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2 Introduction 


Calculus 1 2 3 

2.1 What is calculus? 

Calculus is the branch of mathematics dealing with instantaneous rates of change of continu- 
ously varying quantities. For example, consider a moving car. It is possible to create a function 
describing the displacement of the car (where it is located in relation to a reference point) at any 
point in time as well as a function describing the velocity (speed and direction of movement) of 
the car at any point in time. If the car were traveling at a constant velocity, then algebra would 
be sufficient to determine the position of the car at any time; if the velocity is unknown but still 
constant, the position of the car could be used (along with the time) to find the velocity. 

However, the velocity of a car cannot jump from zero to 35 miles per hour at the beginning of 
a trip, stay constant throughout, and then jump back to zero at the end. As the accelerator is 
pressed down, the velocity rises gradually, and usually not at a constant rate (i.e., the driver may 
push on the gas pedal harder at the beginning, in order to speed up). Describing such motion 
and finding velocities and distances at particular times cannot be done using methods taught in 
pre-calculus, but it is not only possible but straightforward with calculus. 

Calculus has two basic applications: differential calculus 1 and integral calculus'' . The simplest 
introduction to differential calculus involves an explicit series of numbers. Given the series (42, 
43, 3, 18, 34), the differential of this series would be (1, -40, 15, 16). The new series is derived from 
the difference of successive numbers which gives rise to its name "differential". Rarely, if ever, 
are differentials used on an explicit series of numbers as done here. Instead, they are derived 
from a series of numbers defined by a continuous function which are described later. 

Integral calculus, like differential calculus, can also be introduced via series of numbers. Notice 
that in the previous example, the original series can almost be derived solely from its differen- 
tial. Instead of taking the difference, however, integration involves taking the sum. Given the 
first number of the original series, 42 in this case, the rest of the original series can be derived by 
adding each successive number in its differential (42+1, 43-40, 3+15, 18+16). Note that knowl- 
edge of the first number in the original series is crucial in deriving the integral. As with differ- 
entials, integration is performed on continuous functions rather than explicit series of numbers, 
but the concept is still the same. Integral calculus allows us to calculate the area under a curve of 
almost any shape; in the car example, this enables you to find the displacement of the car based 


1 http ://en. wikipedia.org/wiki/Calculus 

2 http://en.wikibooks.org/wiki/Calculus%2FDifferentiation%2FContents 

3 http://en.wikibooks.org/wiki/Calculus%2FIntegration%2FContents 
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on the velocity curve. This is because the area under the curve is the total distance moved, as we 
will soon see. 


2.2 Why learn calculus? 


Calculus is essential for many areas of science and engineering. Both make heavy use of math- 
ematical functions to describe and predict physical phenomena that are subject to continual 
change, and this requires the use of calculus. Take our car example: if you want to design cars, 
you need to know how to calculate forces, velocities, accelerations, and positions. All require 
calculus. Calculus is also necessary to study the motion of gases and particles, the interaction of 
forces, and the transfer of energy. It is also useful in business whenever rates are involved. For ex- 
ample, equations involving interest or supply and demand curves are grounded in the language 
of calculus. 

Calculus also provides important tools in understanding functions and has led to the devel- 
opment of new areas of mathematics including real and complex analysis, topology, and non- 
euclidean geometry. 

Notwithstanding calculus’ functional utility (pun intended), many non-scientists and non- 
engineers have chosen to study calculus just for the challenge of doing so. A smaller number 
of persons undertake such a challenge and then discover that calculus is beautiful in and of it- 
self. 


2.3 What is involved in learning calculus? 


Learning calculus, like much of mathematics, involves two parts: 

• Understanding the concepts: You must be able to explain what it means when you take a 
derivative 4 rather than merely apply the formulas for finding a derivative. Otherwise, you 
will have no idea whether or not your solution is correct. Drawing diagrams, for example, can 
help clarify abstract concepts. 

• Symbolic manipulation: Like other branches of mathematics, calculus is written in symbols 
that represent concepts. You will learn what these symbols mean and how to use them. A good 
working knowledge of trigonometry 5 and algebra 6 is a must, especially in integral calculus. 
Sometimes you will need to manipulate expressions into a usable form before it is possible to 
perform operations in calculus. 


2.4 What you should know before using this text 


There are some basic skills that you need before you can use this text. Continuing with our 
example of a moving car: 


4 http://en.wikipedia.org/wiki/derivative 

5 http://en.wikibooks.org/wiki/trigonometry 

6 http://en.wikibooks.org/wiki/algebra 
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• You will need to describe the motion of the car in symbols. This involves understanding func- 
tions. 

• You need to manipulate these functions. This involves algebra. 

• You need to translate symbols into graphs and vice verse. This involves understanding the 
graphing of functions. 

• It also helps (although it isn’t necessarily essential) if you understand the functions used in 
trigonometry since these functions appear frequently in science. 


2.5 Scope 


The first four chapters of this textbook cover the topics taught in a typical high school or first year 
college course. The first chapter, ../Precalculus/ 7 , reviews those aspects of functions most es- 
sential to the mastery of Calculus, the second, ../Limits/ 8 , introduces the concept of the limit 
process. It also discusses some applications of limits and proposes using limits to examine slope 
and area of functions. The next two chapters, ../Differentiation / 9 and ../Integration/ 10 , 
apply limits to calculate derivatives and integrals. The Fundamental Theorem of Calculus is 
used, as are the essential formulae for computation of derivatives and integrals without resort- 
ing to the limit process. The third and fourth chapters include articles that apply the concepts 
previously learned to calculating volumes, and so on as well as other important formulae. 

The remainder of the central Calculus chapters cover topics taught in higher level Calculus top- 
ics: multivariable calculus, vectors, and series (Taylor, convergent, divergent). 

Finally, the other chapters cover the same material, using formal notation. They introduce the 
material at a much faster pace, and cover many more theorems than the other two sections. They 
assume knowledge of some set theory and set notation. 


7 http://en.wikibooks.org/wiki/ 

8 http://en.wikibooks.org/wiki/ 

9 http://en.wikibooks.org/wiki/ 

10 http://en.wikibooks.org/wiki/ 


%2FPrecalculus%2F 

%2FLimits%2F 

%2FDifferentiation%2F 

%2FIntegration%2F 
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4 Algebra 


This section is intended to review algebraic manipulation. It is important to understand algebra 
in order to do calculus. If you have a good knowledge of algebra, you should probably just skim 
this section to be sure you are familiar with the ideas. 


4.1 Rules of arithmetic and algebra 


The following rules are always true. 


4.1.1 Addition 

• Commutative Law: a+ b = b + a. 

• Associative Law: (a + b) + c = a + {b+ c) . 

• Additive Identity: a + 0 = a . 

• Additive Inverse: a + (-a) = 0 . 


4.1.2 Subtraction 

• Definition: a-b = a+[-b). 


4.1.3 Multiplication 

• Commutative law: ax b= bx a. 

• Associative law: (ax b) x c = ax (b x c) . 

• Multiplicative identity: a x 1 = a . 

• Multiplicative inverse: a x ^ = 1, whenever a ^ 0 

• Distributive law: ax (b + c) = (ax b) + (ax c) . 


4.1.4 Division 


• Definition: | = a x l , whenever b ^ 0 . 

The above laws are true for all a, b, and c, whether a, b, and c are numbers, variables, functions, 
or other expressions. For instance, 


(x+2) (x+3) 
x+3 


= [(x + 2) x (x + 3)] x 
= (x + 2) x [(x + 3) x ( 3 ^ 3 )] 
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= ((X + 2)X(1)), x^-3 

= x + 2, x^-3. 


Of course, the above is much longer than simply cancelling x + 3 out in both the numerator 
and denominator. But, when you are cancelling, you are really just doing the above steps, so it is 
important to know what the rules are so as to know when you are allowed to cancel. Occasionally 
people do the following, for instance, which is incorrect: 

2 x (x + 2) 2 x + 2 x + 2 

2 ~~ 2 X 2 ~~ 2 


The correct simplification is 


2 x (x + 2) 2 x + 2 x + 2 

= — X = 1 X 

2 2 1 1 


x + 2 


where the number 2 cancels out in both the numerator and the denominator. 


4.2 Interval notation 


There are a few different ways that one can express with symbols a specific interval (all the num- 
bers between two numbers). One way is with inequalities. If we wanted to denote the set of all 
numbers between, say, 2 and 4, we could write "all x satisfying 2<x<4." This excludes the end- 
points 2 and 4 because we use < instead of <. If we wanted to include the endpoints, we would 
write "all x satisfying 2 < x < 4." This includes the endpoints. 

Another way to write these intervals would be with interval notation. If we wished to convey 
"all x satisfying 2<x<4" we would write (2,4). This does not include the endpoints 2 and 4. If we 
wanted to include the endpoints we would write [2,4], If we wanted to include 2 and not 4 we 
would write [2,4) ; if we wanted to exclude 2 and include 4, we would write (2,4] . 

Thus, we have the following table: 

Endpoint conditions Inequality notation Interval notation 

Including both 2 and 4 all x satisfying 2 < x < 4 


[2,4] 


Not including 2 nor 4 


all x satisfying 2 < x < 4 


(2,4) 
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Endpoint conditions 

Inequality notation 

Interval notation 

Including 2 not 4 

all x satisfying 2 < x < 4 

[2,4) 

Including 4 not 2 

all x satisfying 2 < x < 4 

(2,4] 

general, we have the following table: 


Meaning 

Interval Notation 

Set Notation 

All values greater than or 
equal to a and less than or 
equal to b 

[a,b] 

{x : a < x < b} 

All values greater than a 
and less than b 

(a, b) 

{x:a< x<b} 

All values greater than or 
equal to a and less than b 

[ a,b ) 

{x: a< x< b} 

All values greater than a 
and less than or equal to b 

(a, b] 

{x : a < x < b] 

All values greater than or 
equal to a. 

[a,oo) 

{x : x> a} 

All values greater than a. 

[a, oo) 

{x : x > a} 

All values less than or equal 
to a. 

(— oo, a] 

{x:x< a] 

All values less than a. 

(— oo, a) 

{x : x < a] 

All values. 

(— 00 , 00 ) 

]x:xel) 


Note that oo and -oo must always have an exclusive parenthesis rather than an inclusive bracket. 
This is because oo is not a number, and therefore cannot be in our set. oo is really just a symbol 
that makes things easier to write, like the intervals above. 

The interval (a,b) is called an open interval, and the interval [a,b] is called a closed interval. 


4.3 Exponents and radicals 


There are a few rules and properties involving exponents and radicals that you’d do well to re- 
member. As a definition we have that if n is a positive integer then a n denotes n factors of a. That 
is, 
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a n = a-a-a---a [n times). 

If a ■£ 0 then we say that a° = 1 . If n is a positive integer we say that a~ n = If we have an 
exponent that’s a fraction we say that a m,n = \Z a m = ( \fa) m . 

These definitions yield the following table of properties: 


Rule 

Example 

a" ■ a m = a n+m 

3 6 -3 9 = 3 15 

a" _ „n-m 
a m 

^ = X 1 = X 

(a n ) m = a nm 

(X 4 ) 5 = X 20 

(. ab) n = a n b n 

(3x) 5 = 3 5 x ! 

/ \ n 


| a | _ a 

(z) - z! 

[ b ) ~~ b n 

UJ “33- 


4.4 Factoring and roots 

Given the expression x 2 + 3x + 2, one may ask "what are the values of x that make this expression 
0?" If we factor we obtain 


x 2 + 3x + 2 — (x + 2) (x + 1) . 

If X—- 1 or -2, then one of the factors on the right becomes zero. Therefore, the whole must be 
zero. So, by factoring we have discovered the values of x that render the expression zero. These 
values are termed "roots." In general, given a quadratic polynomial px 2 + qx + r that factors as 


px 2 + qx+r = {ax + c ) (bx + d) 

then we have that x = -c/a and x = -d/b are roots of the original polynomial. 

A special case to be on the look out for is the difference of two squares, a 2 - b 2 . In this case, we 
are always able to factor as 


a 2 -b 2 = [a + b){a- b). 

For example, consider 4x 2 - 9. On initial inspection we would see that both 4x 2 and 9 are squares 
((2x) 2 = 4x 2 and 3 2 = 9). Applying the previous rule we have 

4x 2 - 9 = (2x + 3) (2x - 3) . 
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4.5 Simplifying rational expressions 


Consider the two polynomials 


p[x) = a n x n + a n -\x n 1 h — + flix+flo 


and 


q{x) = b m x m + b m -\x m 1 h \-bix + b 0 . 

When we take the quotient of the two we obtain 


p(x) _ a n x n + a n - \X n 1 H — + a\X+ao 
q(x) b m x m + b m - — + b\x + bo' 

The ratio of two polynomials is called a rational expression. Many times we would like to sim- 
plify such a beast. For example, say we are given . We may simplify this in the following 
way: 


x 2 -l (x + 1) (x — 1) 

= = x - 1 

X + 1 x+ 1 

This is nice because we have obtained something we understand quite well, x- 1, from some- 
thing we didn’t. 


4.6 Formulas of multiplication of polynomials 


Here are some formulas that can be quite useful for solving polynomial problems: 


{a + b) 2 = a 2 + 2ab + b 2 
( a - b) 2 = a 2 - 2 ab + b 2 
C a-b){a+b ) = a 2 -b 2 
( a ± b ) 3 = a 3 ± 3a 2 b + 3ab 2 ± b 3 
a 3 ±b 3 = (a ± b) ( a 2 + ab + b 2 ) 
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5 Functions 


5.1 Classical understanding of functions 


To provide the classical understanding of functions, think of a function as a kind of machine. 
You feed the machine raw materials, and the machine changes the raw materials into a finished 
product based on a specific set of instructions. The kinds of functions we consider here, for 
the most part, take in a real number, change it in a formulaic way, and give out a real number 
(possibly the same as the one it took in). Think of this as an input-output machine, you give the 
function an input, and it gives you an output. For example, the squaring function takes the input 
4 and gives the output value 16. The same squaring function takes the input -1 and gives the 
output value 1. 

A function is usually written as /, g, or something similar - although it doesn’t have to be. A 
function is always defined as "of a variable" which tells us what to replace in the formula for the 
function. 

For example, fix ) = 3x + 2 tells us: 

• The function / is a function of x. 

• To evaluate the function at a certain number, replace the x with that number. 

• Replacing x with that number in the right side of the function will produce the function’s out- 
put for that certain input. 

• In English, the definition of / is interpreted, "Given a number, / will return two more than the 
triple of that number . " 

Thus, if we want to know the value (or output) of the function at 3: 


fix ) = 3x + 2 


We evaluate the function at 


/(3) = 3(3) + 2 
x = 3 


/(3) =9 + 2= 11 

The value of 

/ 

at 3 is 1 1 . 
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See? It’s easy! 

Note that /( 3) means the value of the dependent variable when x takes on the value of 3. So 
we see that the number 1 1 is the output of the function when we give the number 3 as the input. 
We refer to the input as the argument of the function (or the independent variable) , and to 
the output as the value of the function at the given argument (or the dependent variable) . A 
good way to think of it is the dependent variable fix ) ’depends’ on the value of the independent 
variable x . This is read as "the value of / at three is eleven", or simply "/ of three equals eleven". 


5.2 Notation 

Functions are used so much that there is a special notation for them. The notation is somewhat 
ambiguous, so familiarity with it is important in order to understand the intention of an equation 
or formula. 

Though there are no strict rules for naming a function, it is standard practice to use the letters /, 
g, and h to denote functions, and the variable x to denote an independent variable, y is used for 
both dependent and independent variables. 

When discussing or working with a function /, it’s important to know not only the function, but 
also its independent variable x. Thus, when referring to a function /, you usually do not write 
/, but instead fix). The function is now referred to as "/ of x". The name of the function is 
adjacent to the independent variable (in parentheses). This is useful for indicating the value of 
the function at a particular value of the independent variable. For instance, if 


fix) = 7x+ 1 


and if we want to use the value of / for x equal to 2, then we would substitute 2 for x on both 
sides of the definition above and write 


/(2) = 7(2) + 1 = 14+ 1 = 15 

This notation is more informative than leaving off the independent variable and writing simply 
’/’, but can be ambiguous since the parentheses can be misinterpreted as multiplication. 


5.3 Modern understanding of functions 


The formal definition of a function states that a function is actually a rule that associates ele- 
ments of one set called the domain of the function, with the elements of another set called the 
range of the function. For each value we select from the domain of the function, there exists ex- 
actly one corresponding element in the range of the function. The definition of the function tells 
us which element in the range corresponds to the element we picked from the domain. Classi- 
cally, the element picked from the domain is pictured as something that is fed into the function 
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and the corresponding element in the range is pictured as the output. Since we "pick" the ele- 
ment in the domain whose corresponding element in the range we want to find, we have control 
over what element we pick and hence this element is also known as the "independent variable". 
The element mapped in the range is beyond our control and is "mapped to" by the function. This 
element is hence also known as the "dependent variable", for it depends on which independent 
variable we pick. Since the elementary idea of functions is better understood from the classi- 
cal viewpoint, we shall use it hereafter. However, it is still important to remember the correct 
definition of functions at all times. 

To make it simple, for the function fix), all of the possible x values constitute the domain, and 
all of the values fix) iy on the x-y plane) constitute the range. 


5.4 Remarks 

The following arise as a direct consequence of the definition of functions: 

1. By definition, for each "input" a function returns only one "output", corresponding to that 
input. While the same output may correspond to more than one input, one input cannot 
correspond to more than one output. This is expressed graphically as the vertical line test 
a line drawn parallel to the axis of the dependent variable (normally vertical) will inter- 
sect the graph of a function only once. However, a line drawn parallel to the axis of the 
independent variable (normally horizontal) may intersect the graph of a function as many 
times as it likes. Equivalently, this has an algebraic (or formula-based) interpretation. We 
can always say if a = b, then fia) = fib), but if we only know that fia) = fib) then we can’t 
be sure that a = b. 

2. Each function has a set of values, the function’s domain, which it can accept as input. 
Perhaps this set is all positive real numbers; perhaps it is the set {pork, mutton, beef}. This 
set must be implicitly/ explicitly defined in the definition of the function. You cannot feed 
the function an element that isn’t in the domain, as the function is not defined for that 
input element. 

3. Each function has a set of values, the function’s range, which it can output. This may be 
the set of real numbers, ft may be the set of positive integers or even the set {0,1}. This set, 
too, must be implicitly/ explicitly defined in the definition of the function. 
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5.5 The vertical line test 

The vertical line test, mentioned in the preceding paragraph, is a systematic test to find out if an 
equation involving x and y can serve as a function (with x the independent variable and y the 
dependent variable). Simply graph the equation and draw a vertical line through each point of 
the x-axis. If any vertical line ever touches the graph at more than one point, then the equation 
is not a function; if the line always touches at most one point of the graph, then the equation is a 
function. 
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(There are a lot of useful curves, like circles, that aren’t functions (see picture). Some people call 
these graphs with multiple intercepts, like our circle, "multi-valued functions"; they would refer 
to our "functions" as "single-valued functions".) 


5.6 Important functions 


fix ) = c It disregards the 
input and always outputs 
the constant c, and is a 
polynomial of the zeroth 
degree where fix) = cx°= 
c(l) = c. Its graph is a hori- 
zontal line. 

fix) = mx + c Takes an 
input, multiplies by m and 
adds c. It is a polynomial of 
the first degree. Its graph 
is a line (slanted, except 
m = 0). 

fix) = x Takes an input 
and outputs it unchanged. 
A polynomial of the first 
degree, fix) = x 1 = x. Special 
case of a linear function. 

Quadratic function fix) = ax 2 + bx + c A 

polynomial of the sec- 
ond degree. Its graph is 
a parabola, unless a = 0. 
(Don’t worry if you don’t 
know what this is.) 

Polynomial function fix) = a n x n + a n -ix n ~ l + 

— h a2 x 2 + a\x + flo The 
number n is called the de- 
gree. 

[-1 : x < 0 

Signum function sgn(x) = < 0 : x = 0 

[ 1 : x > 0. 

Determines the sign of the 
argument x. 


Constant function 


Linear function 


Identity function 


5.7 Example functions 


Some more simple examples of functions have been listed below. 
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h{x) - 



if x > 0 
if x < 0 


Gives f if input is positive, - f if input is negative. Note that the function only accepts neg- 
ative and positive numbers, not 

0 


. Mathematics describes this condition by saying 


0 


is not in the domain of the function. 

g(y) - y 2 

Takes an input and squares it. 

g[z) = z 2 

Exactly the same function, rewritten with a different independent variable. This is per- 
fectly legal and sometimes done to prevent confusion (e.g. when there are already too 
many uses of 

x 

or 

y 

in the same paragraph.) 



Note that we can define a function by a totally arbitrary rule. Such functions are called 
piecewise functions. 


It is possible to replace the independent variable with any mathematical expression, not just a 
number. For instance, if the independent variable is itself a function of another variable, then it 
could be replaced with that function. This is called composition, and is discussed later. 


5.8 Manipulating functions 


Functions can be manipulated in the same ways as variables; they can be added, multiplied, 
raised to powers, etc. For instance, let 


and 


/(x) = 3x + 2 


g(x) = x 2 
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Then 


f + g=if + g)ix) 

= fM + g(x) 

= (3x + 2) + (x 2 ) 
— x + 3x + 2 


f-g=if~g)(x) 

= fix) - g(x) 

= (3x + 2) — (x 2 ) 
= — x 2 + 3x + 2 


f x g = if x g)ix) 

= fix) X g(x) 

= (3x + 2) x (x 2 ) 
= 3x 3 + 2x 2 


/ 

g 


T 

. g . 


(x) 


fix) 
gix) 
3x + 2 


3 2 

“ + ~2 
X x z 


5.8. 1 Composition of functions 

However, there is one particular way to combine functions which cannot be done with variables. 
The value of a function / depends upon the value of another variable x; however, that variable 
could be equal to another function g, so its value depends on the value of a third variable. If this 
is the case, then the first variable is a function h of the third variable; this function (h) is called 
the composition of the other two functions (/ and g). Composition is denoted by 


f ° g = if ° g)ix) = figix)) 
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This can be read as either "f composed with g" or "f of g ofx." 
For instance, let 


fix ) = 3x + 2 

and 

g(x) = x 2 

Then 


h[x) = figix)) 

= fix 2 ) 

= 3(x 2 ) +2 
= 3x 2 + 2 

Here, h is the composition of / and g and we write h = / ° g. Note that composition is not 
commutative: 


, and 


figix)) = 3x 2 + 2 


gifix)) = gi 3x + 2) 

= (3x + 2) 2 
= 9x 2 + 12x + 4 

/(gM) ^ g(/(x)) 


Composition of functions is very common, mainly because functions themselves are common. 
For instance, squaring and sine are both functions: 


square (x) = x 2 


sine(x) = sinx 

Thus, the expression sin 2 x is a composition of functions: 

{| 
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|- |sin 2 x | = square(sinx) |- 1 1 = square(sine(x)). |} 

(Note that this is not the same as sine(square(x)) = sinx 2 .) Since the function sine equals 1/2 if 
x = nl6, 


square (sine {nl 6)) = square(l/2) 

Since the function square equals l/4ifx=l/2, 

sin 2 7r/6 = square(sine(7r/6)) = square(l/2) = 1/4 


5.8.2 Transformations 

Transformations are a type of function manipulation that are very common. They consist of 
multiplying, dividing, adding or subtracting constants to either the input or the output. Multi- 
plying by a constant is called dilation and adding a constant is called translation. Here are a few 
examples: 


/( 2 x x) 

Dilation 

fix + 2) 

Translation 

2 x /(x) 

Dilation 

2 + /(x) 

Translation 
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Figure 8: Examples of horizontal and vertical translations 



Figure 9: Examples of horizontal and vertical dilations 


Translations and dilations can be either horizontal or vertical. Examples of both vertical and 
horizontal translations can be seen at right. The red graphs represent functions in their ’original’ 
state, the solid blue graphs have been translated (shifted) horizontally, and the dashed graphs 
have been translated vertically. 

Dilations are demonstrated in a similar fashion. The function 


/( 2 x jc) 

has had its input doubled. One way to think about this is that now any change in the input will 
be doubled. If I add one to x, I add two to the input of /, so it will now change twice as quickly. 
Thus, this is a horizontal dilation by \ because the distance to the y-axis has been halved. A 
vertical dilation, such as 
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2 x fix) 

is slightly more straightforward. In this case, you double the output of the function. The output 
represents the distance from the x-axis, so in effect, you have made the graph of the function 
’taller’. Here are a few basic examples where a is any positive constant: 


Original graph 

fix) 

Rotation about ori- 

-fi-x) 



gin 


Horizontal trans- 

fix + a) 

Horizontal trans- 

fix- a) 

lation by a units 


lation by a units 


left 


right 


Horizontal dilation 

fi xx ^ 

Vertical dilation by 

a x fix ) 

by a factor of a 


a factor of a 


Vertical translation 

fix ) - a 

Vertical translation 

fix) + a 

by a units down 


by a units up 


Reflection about 

-fM 

Reflection about 

fi-x) 

x-axis 


y-axis 
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5.9 Domain and Range 

5.9.1 Domain 



I + 1 

I I 

I I I 

Figure 10: The domain of the function is the interval from -1 to 1 

The domain of a function is the set of all points over which it is defined. More simply, it repre- 
sents the set of x- values which the function can accept as input. For instance, if 

fix) = V l -x 2 
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then fix) is only defined for values of x between -1 and 1, because the square root function 
is not debned (in real numbers) for negative values. Thus, the domain, in interval notation, is 
[-1,1]. In other words, 


fix) is defined for x e [-1, l],or{x : -1 < x < 1} 



Figure 1 1 : The range of the function is the interval from 0 to 1 


5.9.2 Range 

The range of a function is the set of all values which it attains (i.e. the y- values). For instance, if: 


37 


Functions 


fix) = Vl-X 2 


then f{x) can only equal values in the interval from 0 to 1. Thus, the range of / is [0, 1] . 


5.9.3 One-to-one Functions 

A function fix) is one-to-one (or less commonly injective) if, for every value of /, there is only 
one value of x that corresponds to that value of /. For instance, the function fix) = Vl- x 2 is not 
one-to-one, because both x = 1 and x = -1 result in fix) = 0. However, the function fix) = x + 2 
is one-to-one, because, for every possible value of fix), there is exactly one corresponding value 
of x. Other examples of one-to-one functions are fix) = x 3 + ax, where a e [0,oo). Note that if 
you have a one-to-one function and translate or dilate it, it remains one-to-one. (Of course you 
can’t multiply x or / by a zero factor) . 


Horizontal Line Test 

If you know what the graph of a function looks like, it is easy to determine whether or not the 
function is one-to-one. If every horizontal line intersects the graph in at most one point, then 
the function is one-to-one. This is known as the Horizontal Line Test. 


Algebraic 1-1 Test 

If you don’t know what the graph of the function looks like, it is also easy to determine whether 
or not the function is one-to-one. The rule fia) = fib) applies. 


e.g. Is fix) = a 1-1 function? 


fia ) = fib) 


1 - 2 a _ 1 - 2 b 
1 + a 1 + b 


(1 + b)il-2a) = (1 + a)il-2b) 


38 



Domain and Range 


l-2a + b-2ab= \-2b + a-2ab 
1-2 a + b= 1-2 b + a 
1 — 2ci 3b — 1 + rz 
1 + 3b = 1 + 3*2 
a = b 

Therefore by the algebraic 1-1 test, the function fix) is 1-1 

5.9.4 Inverse functions 

We call g(x) the inverse function of fix) if, for all x: 

g(/(x)) = /(g(x)) = x 

A function fix) has an inverse function if and only if fix) is one-to-one. For example, the inverse 
of fix) = x + 2 is g(x) = x - 2. The function fix) = Vl-x 2 has no inverse. 
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Notation 

The inverse function of / is denoted as / _1 (x). Thus, / _1 (x) is defined as the function that 
follows this rule 

/(/ _1 (x)) = / _1 (/(x)) = x: 

To determine / -1 (x) when given a function /, substitute / -1 (x) for x and substitute x for fix). 
Then solve for f~ l (x), provided that it is also a function. 

Example: Given fix ) = 2x- 7, find / _1 (x). 

Substitute / _1 (x) for x and substitute x for fix). Then solve for f~ l (x): 


fix) = 2x-7 


x = 2[/ _1 (x)] - 7 


x + 7 = 2[/ -1 (x)] 


To check your work, confirm that / 1 (/(x)) = x: 

/ _1 (/(x)) = 

/“ 1 (2x-7) = 

2x-7+7 _ 2x _ r 
2 _ 2 _ X 

If / isn’t one-to-one, then, as we said before, it doesn’t have an inverse. Then this method will 
fail. 

Example: Given fix) = x 2 , find/ 1 (x). 

Substitute / -1 (x) for x and substitute x for fix). Then solve for / -1 (x): 


fix) = x 2 


x=if~ 1 ix)f 


f V) = +v/x 

Since there are two possibilities for / -1 (x), it’s not a function. Thus fix) = x 2 doesn’t have an 
inverse. Of course, we could also have found this out from the graph by applying the Horizontal 
Line Test. It’s useful, though, to have lots of ways to solve a problem, since in a specific case some 
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of them might be very difficult while others might be easy. For example, we might only know an 
algebraic expression for fix) but not a graph. 

nliAnalyse/Functies 1 


1 http://nl.wikibooks.org/wiki/Analyse%2FFuncties 
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6 Graphing linear functions 



Figure 12: Graph ofy=2x 


It is sometimes difficult to understand the behavior of a function given only its definition; a 
visual representation or graph can be very helpful. A graph is a set of points in the Cartesian 
plane, where each point (x,y) indicates that fix) = y. In other words, a graph uses the position 
of a point in one direction (the vertical-axis or y-axis ) to indicate the value of / for a position of 
the point in the other direction (the horizontal-axis or x-axis ). 

Functions may be graphed by finding the value of / for various x and plotting the points (x, fix)) 
in a Cartesian plane. For the functions that you will deal with, the parts of the function between 
the points can generally be approximated by drawing a line or curve between the points. Extend- 
ing the function beyond the set of points is also possible, but becomes increasingly inaccurate. 
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6.1 Example 


Plotting points like this is laborious. Fortunately, many functions’ graphs fall into general pat- 
terns. For a simple case, consider functions of the form 


fix) = 3x + 2 

The graph of / is a single line, passing through the point (0,2) with slope 3. Thus, after plotting 
the point, a straightedge may be used to draw the graph. This type of function is called linear 
and there are a few different ways to present a function of this type. 


6.2 Slope-intercept form 


When we see a function presented as 


y = mx + b 

we call this presentation the slope-intercept form. This is because, not surprisingly, this way of 
writing a linear function involves the slope, m, and the y-intercept, b. 

x+y=7 y-x=7 


6.3 Point- slope form 


If someone walks up to you and gives you one point and a slope, you can draw one line and only 
one line that goes through that point and has that slope. Said differently, a point and a slope 
uniquely determine a line. So, if given a point (xq, yo) and a slope in, we present the graph as 


y-y 0 = m(x-xo). 


We call this presentation the point-slope form. The point-slope and slope-intercept form are 
essentially the same. In the point-slope form we can use any point the graph passes through. 
Where as, in the slope-intercept form, we use the y- intercept, that is the point (0 ,b). 


6.4 Calculating slope 


If given two points, (xi,yi) and Uy, jy), we may then compute the slope of the line that passes 
through these two points. Remember, the slope is determined as "rise over run." That is, the 
slope is the change in y- values divided by the change in x- values. In symbols, 
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, change in y Ay 

slope = — = — . 

change in x Ax 


So now the question is, "what’s Ay and Ax?" We have that Ay = y 2 - yi and Ax = X 2 - Xi. Thus, 


slope = 


T2-yi 
x 2 - Xi ' 


6.5 Two-point form 

Two points also uniquely determine a line. Given points (xi,yi) and (X 2 ,y 2 ), we have the equa- 
tion 


n-y\, , 

y-yi = (x-xi). 

x 2 -xi 


This presentation is in the two-point form. It is essentially the same as the point-slope form 
except we substitute the expression for m. 

nl^nalyse/Functies 1 


6.6 Exercises 

6.7 Algebra 

6.7. 1 Convert to interval notation 

1. -4 < x < 2 

2. -7/3 < x < -1/3 

3. -n<x<n 

4. x < 17/9 

5. 5 < x+ 1 < 6 

6. x— 1/4 < 1 

7. 3 > 3x 

8. 0 < 2x + 1 < 3 

9. 5 < x and x < 6 

10. 5 < x or x< 6 


1 http://nl.wikibooks.org/wiki/Analyse%2FFuncties 
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6.7.2 State the following intervals using inequalities 

1. [3,4] 

2. [3,4) 

3. (3,oo) 

4. (-1/3, 1/3) 

5. (-7T, 15/16) 

6. (— oo,oo) 


6.7.3 Which one of the following is a true statement? 

Hint: the true statement is often referred to as the triangle inequality. Give examples where the 
other two are false. 

1. |x + y| = |x| + |y| 

2. |x + y| > |x| + |y| 

3. |x + y| < |x| + |y| 


6.7.4 Evaluate the following expressions 


1. 8 1/3 

2 . (— 8) 1/3 

r , \ 1/3 


4. 

5. 


( 8 2 / 3 )( 8 3/2 


( l 

A 8 


l/3\7 


) ( 8 °) 


6 . 

7. 

8 . 

9. 


3/27 

V 8 

4 5 - 4- 2 

4 3 



\/27 

^9 


6.7.5 Simplify the following 


1. x 3 + 3x 3 

2 x 3 +3x 3 

3. (x 3 + 3x 3 ) 3 


4. 

5. 

6 . 


x 15 +x 3 

X 


(2x 2 )(3x“ 2 ) 

x 2 y 3 


7. sjx 2 y 4 
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6.7.6 Factor the following expressions 

For 1-8, determine what values of x make the expression 0 (i.e. determine the roots). 

1. x 2 — 1 

2. x 2 + 2x + 1 

3. x 2 + 7x + 12 

4. 3x 2 -5x-2 

5. x 2 + 5/6x+ 1/6 

6. 4x 3 + 4x 2 + x 

7. x 4 — 1 

8. x 3 + 2x 2 — 4x — 8 

9. 4a 2 - ab-3b 2 

10. (c + d) 2 - 4 

11. 4x 2 - 9y 2 


6.7.7 Simplify the following 


o 3 x 2 +4x+1 
x+1 

o 4 x 2 -9 
4x 2 + 12x+9 
. x 2 +y 2 +2xy 

' x(x+y) 


6.8 Functions 


1. Let/(x) = x 2 . 

a) Compute /( 0) and /( 2). 

b) What are the domain and range of /? 

c) Does / have an inverse? If so, find a formula for it. 

2. Let/(x) = x + 2, g(x) = 1/x. 

a) Give formulae for 

i- f + g, 

ii- f~g, 

iii- g~f, 
iv. /xg, 
v- fig, 

vi. gif, 

vii. fog and 

viii. g°/. 

b) Compute /(g(2)) and g(/(2)). 

c) Do / and g have inverses? If so, find formulae for them. 

3. Does this graph represent a function? 


47 



Graphing linear functions 



4. Consider the following function 

if x < - 1 
if - 1 < x < 0 
if x > 0. 

a) What is the domain? 

b) What is the range? 

c) Where is / continuous? 

5. Consider the following function 


a) What is the domain? 

b) What is the range? 

c) Where is / continuous? 

6. Consider the following function 


fix) = 


x 2 if x > 0 
-1 if x < 0. 


fix) = < 


_i 

9 

2 

x + 3 
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a) What is the domain? 

b) What is the range? 

c) Where is / continuous? 

7. Consider the following function 


fix) = 


\j2x-3 
x- 10 


a) What is the domain? 

b) What is the range? 

c) Where is / continuous? 


fix) = 


x-7 
x 2 — 49 


6.8.1 Decomposition of functions 


For each of the following functions, h, find functions / and g such that (/(g(x)) = h(x) 


1 . 

2 . 

3. 


h{x) = x 3 + 4x + 5 + V x 2 + 3x + 2 
h(x) = 


(x+l ) 2 


h{x) = \ \ +3 


6.9 Graphing 

1. Find the equation of the line that passes through the point (1,-1) and has slope 3. 

2. Find the equation of the line that passes through the origin and the point (2,3). 

Solutions 2 


2 Chapter 62.0.2 on page 407 
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7 Limits 


7.1 Intuitive Look 


A limit looks at what happens to a function when the input approaches a certain value. The 
general notation for a limit is as follows: 


lim fix) 

x^a 

This is read as "The limit of / of x as x approaches a". We’ll take up later the question of how we 
can determine whether a limit exists for fix) at a and, if so, what it is. For now, we’ll look at it 
from an intuitive standpoint. 

Let’s say that the function that we’re interested in is /(x) = x 2 , and that we’re interested in its 
limit as x approaches 2. Using the above notation, we can write the limit that we’re interested in 
as follows: 


lim x 2 

x^2 

One way to try to evaluate what this limit is would be to choose values near 2, compute fix ) for 
each, and see what happens as they get closer to 2. This is implemented as follows: 

X 1.7 1.8 1.9 1.95 1.99 1.999 

fix) = 2.89 3.24 3.61 3.8025 3.9601 3.996001 


Here we chose numbers smaller than 2, and approached 2 from below. We can also choose num- 
bers larger than 2, and approach 2 from above: 

x 2.3 2.2 2.1 2.05 2.01 2.001 

fix) = 5.29 4.84 4.41 4.2025 4.0401 4.004001 


We can see from the tables that as x grows closer and closer to 2, fix) seems to get closer and 
closer to 4, regardless of whether x approaches 2 from above or from below. For this reason, we 
feel reasonably confident that the limit of x 2 as x approaches 2 is 4, or, written in limit notation, 


lim x 2 = 4. 

x^2 
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Now let’s look at another example. Suppose we’re interested in the behavior of the function 
fix ) = ^2 as x approaches 2. Here’s the limit in limit notation: 


lim 

x—>2 X - 2 

Just as before, we can compute function values as x approaches 2 from below and from above. 
Here’s a table, approaching from below: 


x 1.7 1.8 

fix ) = -3.333 -5 

l 

x-2 


1.9 1.95 1.99 1.999 

-10 -20 -100 -1000 


And here from above: 

x 2.3 2.2 2.1 

fix) = 3.333 5 10 

l 

x-2 


2.05 2.01 2.001 

20 100 1000 


In this case, the function doesn’t seem to be approaching any value as x approaches 2. In this 
case we would say that the limit doesn’t exist. 

Both of these examples may seem trivial, but consider the following function: 


fix ) = 


x 2 (x - 2) 
x-2 


This function is the same as 


f x 2 if x 1 2 

X {undefined ifx = 2 

Note that these functions are really completely identical; not just "almost the same," but actually, 
in terms of the definition of a function, completely the same; they give exactly the same output 
for every input. 

In algebra, we would simply say that we can cancel the term (x-2), and then we have the function 
fix) = x 2 . This, however, would be a bit dishonest; the function that we have now is not really the 
same as the one we started with, because it is defined when x = 2, and our original function was 
specifically not defined when x = 2. fn algebra we were willing to ignore this difficulty because 
we had no better way of dealing with this type of function. Now, however, in calculus, we can 
introduce a better, more correct way of looking at this type of function. What we want is to be 
able to say that, although the function doesn’t exist when x = 2, it works almost as though it 
does. It may not get there, but it gets really, really close. That is, /( 1.99999) = 3.99996. The only 
question that we have is: what do we mean by "close"? 
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7.2 Informal definition of a limit 

As the precise definition of a limit is a bit technical, it is easier to start with an informal definition; 
we’ll explain the formal definition later. 

We suppose that a function / is defined for x near c (but we do not require that it be defined 
when x = c). 

Definition: (Informal definition of a limit) 

We call L the limit of /(x) as x approaches c if /(x) becomes close to L when x is close (but not 
equal) to c. 

When this holds we write 

lim fix) = L 
or 

f (x) — * L as x — *■ c. 


Notice that the definition of a limit is not concerned with the value of /(x) when x = c (which 
may exist or may not). All we care about are the values of /(x) when x is close to c, on either the 
left or the right (i.e. less or greater). 


7.3 Limit rules 

Now that we have defined, informally, what a limit is, we will list some rules that are useful for 
working with and computing limits. You will be able to prove all these once we formally define 
the fundamental concept of the limit of a function. 

First, the constant rule states that if fix) = b (that is, / is constant for all x) then the limit as x 
approaches c must be equal to b. In other words 

{{Calculus/Def|text= Constant Rule for Limits 


If b and c are constants then 


•}} 


lim b = b 

X— *-C 


Second, the identity rule states that if fix) = x (that is, / just gives back whatever number you 
put in) then the limit of / as x approaches c is equal to c. That is, 

{{Calculus/Def|text= Identity Rule for Limits 
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If c is a constant then 


•}} 


lim x = c 


The next few rules tell us how, given the values of some limits, to compute others. 

Operational Identities for Limits 


Suppose that lim X ^ c f[x) = L and lim , 
• lim x ^ c kf(x) = /c- lirn,_ r /(x) = Id 


- g(x) = M and that k is constant. Then 


lim x ^ c [/(x) + g(x)] = hm x ^ c /(x) + lim x ^ c g(x) = L + M 
lim x ^ c [/(x)-g(x)] =lim x ^ c /(x) -lim x ^ c g(x) = L-M 
lim x ^ c [/(x)g(x)] = lim x ^ c /(x)lim x ^ c g(x) = LM 

provided M ^ 0 


li m ZM - _ A 


g(x) lim x_ c g(x) 


Notice that in the last rule we need to require that M is not equal to zero (otherwise we would be 
dividing by zero which is an undefined operation). 

These rules are known as identities; they are the scalar product, sum, difference, product, and 
quotient rules for limits. (A scalar is a constant, and, when you multiply a function by a constant, 
we say that you are performing scalar multiplication.) 

Using these rules we can deduce another. Namely, using the rule for products many times we get 
that 


for a positive integer 


lim/(x)” - |litn/(x)j” 


n 


This is called the power rule. 


7.3.1 Examples 
Example 1 

Find the limit lim x _2 4x 3 . 

We need to simplify the problem, since we have no rules about this expression by itself. We know 
from the identity rule above that lim c _2 x = 2. By the power rule, lim x _2 x 3 = (lim x _2 x) :i = 2 3 = 
8. Lastly, by the scalar multiplication rule, we get lim x _ 2 4x 3 = 4 lim c _ 2 x 3 = 4 • 8 = 32. 

Example 2 

Find the limit lim x _2 |4x 3 + 5x + 7] . 

To do this informally, we split up the expression, once again, into its components. As 
above, lim x ^2 4x 3 = 32. 
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Also lim x _ 2 5x = 5 • lim x _ 2 x = 5 • 2 = 10 and lim x _ 2 7 = 7. Adding these together gives 
lim 4x 3 + 5x + 7 = lim 4x 3 + lim 5x + lim 7 = 32 + 10 + 7 = 49 

x— 2 x— 2 x— 2 x— 2 


Example 3 

Find the limit lim x ^ 2 ^Kx+ioy 

From the previous example the limit of the numerator is lim x _2 4x 3 + 5x+ 7 = 49. The limit of the 
denominator is 


lim(x-4)(x+ 10) = lim(x-4) • lim(x + 10) = (2-4) • (2 + 10) = -24. 

x — *1 x — *1 x — >2 

As the limit of the denominator is not equal to zero we can divide. This gives 


lim 


4x + 5x + 7 


49 


x-2(x-4)(x+10) 24 


Example 4 

Find the limit lim x ^ 4 * 4 ~ 3( ^i +7 . 

We apply the same process here as we did in the previous set of examples; 


, x 4 -16x + 7 lim ( _ 4 {x 4 - 16x + 7) lim x _ 4 (x) - lim x _ 4 (16x) + lim x _ 4 (7) 

inn = — = 

x— 4 4x-5 hm x _ 4 (4x - 5) lim x _ 4 (4x) - hm x _ 4 5 


We can evaluate each of these; lim x ^ 4 (x 4 ) = 256, lim x _ 4 (16x) = 64, lim x _ 4 (7) = 7, lim x _ 4 (4x) = 
16 and lim x— 4 (5) = 5. Thus, the answer is 

Example 5 

Find the limit lim x ^ 0 x >sx . 

To evaluate this seemingly complex limit, we will need to recall some sine and cosine identities. 
We will also have to use two new facts. First, if fix ) is a trigonometric function (that is, one of 
sine, cosine, tangent, cotangent, secant or cosecant) and is defined at a, then lim x ^ fl fix) = fia). 
Second, lim x ^o = 1. 

To evaluate the limit, recognize that 1 - cos x can be multiplied by 1 + cos x to obtain (1 - cos 2 x) 
which, by our trig identities, is sin 2 x. So, multiply the top and bottom by 1 + cosx. (This is 
allowed because it is identical to multiplying by one.) This is a standard trick for evaluating 
limits of fractions; multiply the numerator and the denominator by a carefully chosen expression 
which will make the expression simplify somehow. In this case, we should end up with: 
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1-cosx 
lim 

x^O X 


lim 

x^O 


1-cosx 11 


lim 


x 1 

, /"1-cosx 1 + cosx 

lim 

x^O\ X 1 + cosx 

(1 - cos x) • 1 + (1 - cos x) • COS X 


lim 

x^O 


x • (1 + cos x) 

1 -cosx + cosx- 


COS 2 X 


x- (1 + cos x) 

lim 


cos 2 x 


x^Ox- (1 + cosx) 
sin 2 x 

lim 

X-Ox - (1 + cosx) 
/ 


lim 

x^O 


sinx 


smx 
1 + cos x 


Our next step should be to break this up into lim_ t _o • lim x _o by the product rule. As 

mentioned above, lim ,_ 0 = 1. 


Next, lim 


sinx _ lim x ^ 0 sinx _ 0 

x ^° 1+cosx lim v _ 0 (l+cosx) — 1+cosO 


= 0 . 


Thus, by multiplying these two results, we obtain 0. 


We will now present an amazingly useful result, even though we cannot prove it yet. We can find 
the limit at c of any polynomial or rational function, as long as that rational function is defined 
at c (so we are not dividing by zero) . That is, c must be in the domain of the function. 

{{Calculus/Def |title=Limits of Polynomials and Rational functions |text= If / is a polynomial or 
rational function that is defined at c then 


lim /(x) = /(c) 

X-*C 

</center>}} 

We already learned this for trigonometric functions, so we see that it is easy to find limits of 
polynomial, rational or trigonometric functions wherever they are defined. In fact, this is true 
even for combinations of these functions; thus, for example, lim ( _i (sinx 2 + 4cos 3 (3x- 1)) = 
sinl 2 +4 cos 3 (3(1) - 1). 
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7.3.2 The Squeeze Theorem 



Figure 14: Graph showing / being squeezed between g and h 


The Squeeze Theorem is very important in calculus, where it is typically used to find the limit of 
a function by comparison with two other functions whose limits are known. 

It is called the Squeeze Theorem because it refers to a function / whose values are squeezed 
between the values of two other functions g and h, both of which have the same limit L. If the 
value of / is trapped between the values of the two functions g and h, the values of / must also 
approach L. 

Expressed more precisely: 

Theorem: (Squeeze Theorem) 

Suppose that g(x) < /(x) < h{x) holds for all x in some open interval containing a, except possi- 
bly at x = a itself. Suppose also that lim x _„ g(x) = h(x) = L. Then lim x _ fl /(x) = L also. 
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Figure 15: Plot ofx*sin(l/x) for -0.5 < x<0.5 


Example: Compute lim x ^o xsin(l/x). Note that the sine of anything is in the interval [-1,1]. 
That is, - 1 < sin x < 1 for all x. If x is positive, we can multiply these inequalities by x and get 
-x < xsin(l/x) < x. If x is negative, we can similarly multiply the inequalities by the positive 
number -x and get x < xsin(l/x) < -x. Putting these together, we can see that, for all nonzero 
x, - |x| < xsin(l/x) < |x|. But it’s easy to see that lim x ^o - 1*1 = lim x ^o \x\ = 0. So, by the Squeeze 
Theorem, lim x ^oXsin(l/x) = 0. 


7.4 Finding limits 


Now, we will discuss how, in practice, to find limits. First, if the function can be built out of ratio- 
nal, trigonometric, logarithmic and exponential functions, then if a number c is in the domain 
of the function, then the limit at c is simply the value of the function at c. 

If c is not in the domain of the function, then in many cases (as with rational functions) the 
domain of the function includes all the points near c, but not c itself. An example would be if we 
wanted to find lim x _o y, where the domain includes all numbers besides 0. 

In that case, in order to find lim,-_ f /(x) we want to find a function g(x) similar to /(x), except 
with the hole at c filled in. The limits of / and g will be the same, as can be seen from the 
definition of a limit. By definition, the limit depends on fix) only at the points where x is close to 
c but not equal to it, so the limit at c does not depend on the value of the function at c. Therefore, 
if lim x ^ c g(x) = L, lim X ^ c /(x) = L also. And since the domain of our new function g includes c, 
we can now (assuming g is still built out of rational, trigonometric, logarithmic and exponential 
functions) just evaluate it at c as before. Thus we have lim x _ f: /(x) = g(c). 

In our example, this is easy; canceling the x’s gives g(x) = 1, which equals fix ) = xlx at all points 
except 0. Thus, we have lim x _o | = lim x _o 1 = 1. In general, when computing limits of rational 
functions, it’s a good idea to look for common factors in the numerator and denominator. 

Lastly, note that the limit might not exist at all. There are a number of ways in which this can 
occur: 
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9 
8 
7 
6 
5 
4 
3 
2 
1 
0 

- 8 - 6 - 4-2 0 2 4 6 8 

Figure 16: fix) = V x 2 - 16 



"Gap": There is a gap (not just a single point) where the function is not defined. As an example, 
in 


f{x) = \/x 2 - 16 


does not exist when 


lim/(x) 


—4 < c < 4 


. There is no way to "approach" the middle of the graph. Note that the function also 
has no limit at the endpoints of the two curves generated (at 


c = -4 


). For the limit to exist, the point must be approachable from both the left and the 
right. Note also that there is no limit at a totally isolated point on the graph. 

"Jump": If the graph suddenly jumps to a different level, there is no limit. For example, let fix ) 
be the greatest integer < x. Then, if c is an integer, when x approaches c from the right 
fix) = c, while when x approaches c from the left fix) = c- 1. Thus lim *_ c /(x) will not 
exist. 
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-2 - 1.5 -1 - 0.5 0 0.5 1 1.5 2 


Figure 17: A graph of 1 / (jc 2 ) on the interval [-2,2], 


Vertical asymptote: In 


the graph gets arbitrarily high as it approaches 0, so there is no limit. (In this case we 
sometimes say the limit is infinite; see the next section.) 
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0 0.05 0.1 0.15 0.2 0.25 0.3 

Figure 18: A graph of sin(l/x) on the interval (0,1/]. 


Infinite oscillation: These next two can be tricky to visualize. In this one, we mean that a graph 
continually rises above and falls below a horizontal line. In fact, it does this infinitely often 
as you approach a certain x- value. This often means that there is no limit, as the graph 
never approaches a particular value. However, if the height (and depth) of each oscilla- 
tion diminishes as the graph approaches the x-value, so that the oscillations get arbitrarily 
smaller, then there might actually be a limit. 

The use of oscillation naturally calls to mind the trigonometric functions. An exam- 
ple of a trigonometric function that does not have a limit as 

x 


approaches 0 is 


fix) = sin-. 

x 

As 

x 

gets closer to 0 the function keeps oscillating between 

-1 


and 1 . In fact, 


sin(l/x) 
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oscillates an infinite number of times on the interval between 0 and any positive 
value of 

x 

. The sine function is equal to zero whenever 


x = kn 


, where 


k 


is a positive integer. Between every two integers 


k 


sinx 


goes back and forth between 0 and 
or 0 and 1. Hence, 


-1 


for every 


sin(l/x) = 0 


x = ll{kn) 

. In between consecutive pairs of these values, 


and 


1 /(kn) 

ll[(k+l)TC] 


sin(l/x) 

goes back and forth from 0, to either 

-1 

or 1 and back to 0. We may also observe that there are an infinite number of such 
pairs, and they are all between 0 and 

l/n 

. There are a finite number of such pairs between any positive value of 


x 


and 


1/ 71 

, so there must be infinitely many between any positive value of 


x 
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and 0. From our reasoning we may conclude that, as 

x 


approaches 0 from the right, the function 

sin(l/x) 

does not approach any specific value. Thus, 

lim sin(l/x) 

x^O 


does not exist. 


7.5 Using limit notation to describe asymptotes 


Now consider the function 


1 

gW = —y • 
x z 

What is the limit as x approaches zero? The value of g(0) does not exist; it is not defined. 

sm = T 

Notice, also, that we can make g(x) as large as we like, by choosing a small x, as long as x ■£ 0. For 
example, to make g(x) equal to one trillion, we choose x to be 10 -6 . Thus, lim ,_o 1 lx 2 does not 
exist. 

Flowever, we do know something about what happens to g(x) when x gets close to 0 without 
reaching it. We want to say we can make g(x) arbitrarily large (as large as we like) by taking x to 
be sufficiently close to zero, but not equal to zero. We express this symbolically as follows: 

lim g(x) = lim -^=00 

x^O x^O X z 

Note that the limit does not exist at 0; for a limit, being 00 is a special kind of not existing. In 
general, we make the following definition. 
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Definition: Informal definition of a limit being +oo 

We say the limit of fix) as x approaches c is infinity if fix) becomes very big (as big as we like) 
when x is close (but not equal) to c. 

In this case we write 

lim/(x) = oo 
or 

f (x) * oo as x — * c 

Similarly, we say the limit of fix) as x approaches c is negative infinity if fix) becomes very 
negative when x is close (but not equal) to c. 

In this case we write 

lim/(x) = -oo 
or 

fix) — -oo as x — * c 


An example of the second half of the definition would be that lim x _ (l -1/x 2 = -oo. 


7.6 Key application of limits 


To see the power of the concept of the limit, let’s consider a moving car. Suppose we have a car 
whose position is linear with respect to time (that is, a graph plotting the position with respect 
to time will show a straight line). We want to find the velocity. This is easy to do from algebra; we 
just take the slope, and that’s our velocity. 

But unfortunately, things in the real world don’t always travel in nice straight lines. Cars speed 
up, slow down, and generally behave in ways that make it difficult to calculate their velocities. 

Now what we really want to do is to find the velocity at a given moment (the instantaneous ve- 
locity). The trouble is that in order to find the velocity we need two points, while at any given 
time, we only have one point. We can, of course, always find the average speed of the car, given 
two points in time, but we want to find the speed of the car at one precise moment. 

This is the basic trick of differential calculus, the first of the two main subjects of this book. We 
take the average speed at two moments in time, and then make those two moments in time 
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closer and closer together. We then see what the limit of the slope is as these two moments in 
time are closer and closer, and say that this limit is the slope at a single instant. 

We will study this process in much greater depth later in the book. First, however, we will need 
to study limits more carefully. 


7.7 External links 

• Online interactive exercises on limits 1 

• Tutorials for the calculus phobe 2 

nl:Analyse/Limieten 3 


1 http://wims.unice.fr/wims/wims.cgi ?module=home& se arc h_key word s = l imi t& search. 

CATE GO R Y=X 

2 http://www.calculus-help.com/funstuff/phobe.html 

3 http://nl.wikibooks.org/wiki/Analyse%2FLimieten 
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8 Finite Limits 


8.1 Informal Finite Limits 

Now, we will try to more carefully restate the ideas of the last chapter. We said then that the 
equation lim x ^2 fix) = 4 meant that, when x gets close to 2, /(x) gets close to 4. What exactly 
does this mean? How close is "close"? The first way we can approach the problem is to say that, 
at x = 1.99, fix) = 3.9601, which is pretty close to 4. 

Sometimes however, the function might do something completely different. For instance, sup- 
pose f{x) = x 4 - 2x 2 - 3.77, so y (1.99) = 3.99219201. Next, if you take a value even closer to 2, 
/(1.999) = 4.20602, in this case you actually move further from 4. As you can see here, the prob- 
lem with some functions is that, no matter how close we get, we can never be sure what they 
do. 

The solution is to find out what happens arbitrarily close to the point. In particular, we want to 
say that, no matter how close we want the function to get to 4, if we make x close enough to 2 
then it will get there. In this case, we will write 


lim fix) = 4 

x^2 

and say "The limit of fix), as x approaches 2, equals 4" or "As x approaches 2, fix) approaches 
4." In general: 

Definition: (New definition of a limit) 

We call L the limit of fix) as x approaches c if fix) becomes arbitrarily close to L whenever x 
is sufficiently close (and not equal) to c. 

When this holds we write 

lim fix) = L 
or 

f (x) — * L as x — *■ c. 
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8.2 One-Sided Limits 

Sometimes, it is necessary to consider what happens when we approach an x value from one 
particular direction. To account for this, we have one-sided limits. In a left-handed limit, x 
approaches a from the left-hand side. Likewise, in a right-handed limit, x approaches a from 
the right-hand side. 

For example, if we consider lim x ^2 Vx — 2, there is a problem because there is no way for x to 
approach 2 from the left hand side (the function is undefined here). But, if x approaches 2 only 
from the right-hand side, we want to say that Vx-2 approaches 0. 

Definition: (Informal definition of a one-sided limit) 

We call L the limit of /(x) as x approaches c from the right if /(x) becomes arbitrarily close to 
L whenever x is sufficiently close to and greater than c. 

When this holds we write 

lim fix) = L. 

X-* 'C + 

Similarly, we call L the limit of fix) as x approaches c from the left if fix) becomes arbitrarily 
close to L whenever x is sufficiently close to and less than c. 

When this holds we write 

lim fix) = L. 

x—>c 


In our example, the left-handed limit lim ,^2 Vx-2 does not exist. 

The right-handed limit, however, lim x _ 2 ' Vx-2 = 0. 

It is a fact that lim x _ f: /(x) exists if and only if lim x _ c i fix) and lim x _ c fix) exist and are equal 
to each other. In this case, lim x _ c /(x) will be equal to the same number. 

In our example, one limit does not even exist. Thus lim x ^2 Vx-2 does not exist either. 


68 




9 Infinite Limits 


9.1 Informal infinite limits 

Another kind of limit involves looking at what happens to fix) as x gets very big. For example, 
consider the function /(x) = 1/x. As x gets very big, 1/x gets very small. In fact, 1/x gets closer 
and closer to zero the bigger x gets. Without limits it is very difficult to talk about this fact, be- 
cause x can keep getting bigger and bigger and 1/x never actually gets to zero; but the language 
of limits exists precisely to let us talk about the behavior of a function as it approaches something 
- without caring about the fact that it will never get there. In this case, however, we have the same 
problem as before: how big does x have to be to be sure that /(x) is really going towards 0? 

In this case, we want to say that, however close we want /(x) to get to 0, for x big enough /(x) is 
guaranteed to get that close. So we have yet another definition. 

Definition: (Definition of a limit at infinity) 

We call L the limit of /(x) as x approaches infinity if /(x) becomes arbitrarily close to L when- 
ever x is sufficiently large. 

When this holds we write 

lim fix) = L 

x — ►oo 

or 

fix) — L as x — » oo. 

Similarly, we call L the limit of fix) as x approaches negative infinity if fix) becomes arbitrarily 
close to L whenever x is sufficiently negative. 

When this holds we write 

lim fix) = L 

x — *•— OO 

or 

fix) — L as x — -oo. 


So, in this case, we write: 
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lim - = 0 

x 

and say "The limit, as x approaches infinity, equals 0," or "as x approaches infinity, the function 
approaches 0". 

We can also write: 


lim — = 0, 
x 

because making x very negative also forces 1/x to be close to 0. 

Notice, however, that infinity is not a number 1 ; it’s just shorthand for saying "no matter how 
big." Thus, this is not the same as the regular limits we learned about in the last two chapters. 

9.2 Limits at infinity of rational functions 


One special case that comes up frequently is when we want to find the limit at oo (or -oo) of 
a rational function. A rational function is just one made by dividing two polynomials by each 
other. For example, /(x) = (x 3 + x - 6) / (x 2 - 4x + 3) is a rational function. Also, any polynomial 
is a rational function, since 1 is just a (very simple) polynomial, so we can write the function 
fix) ~ x 2 - 3 as fix) = (x 2 — 3) / 1, the quotient of two polynomials. 

There is a simple rule for determining a limit of a rational function as the variable approaches 
infinity. Look for the term with the highest exponent in the numerator. Look for the same in the 
denominator. This rule is based on that information. 

• If the exponent of the highest term in the numerator matches the exponent of the highest term 
in the denominator, the limit (at both oo and -oo) is the ratio of the coefficients of the highest 
terms. 

• If the numerator has the highest term, then the fraction is called "top-heavy" and neither limit 
(at oo or at -oo) exists. 

• If the denominator has the highest term, then the fraction is called "bottom-heavy" and the 
limit (at both oo and -oo) is zero. 

Note that, if the numerator or denominator is a constant (including 1, as above), then this is the 
same as x°. Also, a straight power of x, like x 3 , has coefficient 1, since it is the same as lx 3 . 


9.2.1 Example 

Find lim^fEf. 

The function fix) = (x - 5) / (x - 3) is the quotient of two polynomials, x - 5 and x - 3. By our 
rule we look for the term with highest exponent in the numerator; it’s x. The term with highest 


1 http://en.wikibooks.org/wiki/%2FInfinity%20is%20not%20a%20number 
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exponent in the denominator is also x. So, the limit is the ratio of their coefficients. Since x = lx, 
both coefficients are 1, so liin r _ 00 (x-5)/(x-3) = 1/1 = 1. 
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10 Continuity 


10.1 Defining Continuity 

We are now ready to define the concept of a function being continuous. The idea is that we want 
to say that a function is continuous if you can draw its graph without taking your pencil off the 
page. But sometimes this will be true for some parts of a graph but not for others. Therefore, 
we want to start by defining what it means for a function to be continuous at one point. The 
definition is simple, now that we have the concept of limits: 

Definition: (continuity at a point) 

If fix) is defined on an open interval containing c, then fix) is said to be continuous at c if and 
only if lim x _ c / (x) = /(c). 


Note that for / to be continuous at c, the definition in effect requires three conditions: 

1. that / is defined at c, so /(c) exists, 

2. the limit as x approaches c exists, and 

3. the limit and /(c) are equal. 

If any of these do not hold then / is not continuous at c. 

The idea of the definition is that the point of the graph corresponding to c will be close to the 
points of the graph corresponding to nearby x-values. Now we can define what it means for a 
function to be continuous in general, not just at one point. 

Definition: (continuity) 

A function is said to be continuous on ia, b) if it is continuous at every point of the interval ia,b). 


We often use the phrase "the function is continuous" to mean that the function is continu- 
ous at every real number. This would be the same as saying the function was continuous on 
(&minus;oo, oo), but it is a bit more convenient to simply say "continuous". 

Note that, by what we already know, the limit of a rational, exponential, trigonometric or loga- 
rithmic function at a point is just its value at that point, so long as it’s defined there. So, all such 
functions are continuous wherever they’re defined. (Of course, they can’t be continuous where 
they’re not defined!) 
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10.2 Discontinuities 

A discontinuity is a point where a function is not continuous. There are lots of possible ways this 
could happen, of course. Here we’ll just discuss two simple ways. 


10.2.1 Removable Discontinuities 

The function /(x) = is not continuous at x = 3. It is discontinuous at that point because the 
fraction then becomes g, which is undefined. Therefore the function fails the first of our three 
conditions for continuity at the point 3; 3 is just not in its domain. 

However, we say that this discontinuity is removable. This is because, if we modify the function 
at that point, we can eliminate the discontinuity and make the function continuous. To see how 
to make the function /(x) continuous, we have to simplify /(x), getting /(x) = = tX +x- 3 )~ 3> = 

• We can define a new function g(x) where g(x) = x+3. Note that the function g(x) is not 
the same as the original function /(x), because g(x) is defined at x = 3, while /(x) is not. Thus, 
g(x) is continuous at x = 3, since lim x ^ 3 (x + 3) = 6 = g(3). However, whenever x ■£ 3, /(x) = g(x); 
all we did to / to get g was to make it defined at x = 3. 

In fact, this kind of simplification is often possible with a discontinuity in a rational function. 
We can divide the numerator and the denominator by a common factor (in our example x - 3) 
to get a function which is the same except where that common factor was 0 (in our example at 
x = 3) . This new function will be identical to the old except for being defined at new points where 
previously we had division by 0. 

Unfortunately this is not possible in every case. For example, the function /(x) = t2 *gy +9 has 
a common factor of x - 3 in both the numerator and denominator, but when you simplify you 
are left with g(x) = Which is still not defined at x = 3. In this case the domain of /(x) and 
g(x) are the same, and they are equal everywhere they are defined, so they are in fact the same 
function. The reason that g(x) differed from /(x) in the first example was because we could take 
it to have a larger domain and not simply that the formulas defining /(x) and g(x) were different. 
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10.2.2 Jump Discontinuities 



Unfortunately, not all discontinuities can be removed from a function. Consider this function: 


k{x) = 



if x > 0 
if x < 0 


Since lirn x _o k(x) does not exist, there is no way to redefine k at one point so that it will be 
continuous at 0. These sorts of discontinuities are called nonremovable discontinuities. 

Note, however, that both one-sided limits exist; lim x ^o- fc(x) = -1 and lim x ^o+ fc(x) = 1. The 
problem is that they are not equal, so the graph "jumps" from one side of 0 to the other. In such 
a case, we say the function has a jump discontinuity. (Note that a jump discontinuity is a kind of 
nonremovable discontinuity.) 


10.3 One-Sided Continuity 

Just as a function can have a one-sided limit, a function can be continuous from a particular 
side. For a function to be continuous at a point from a given side, we need the following three 
conditions: 
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1 . the function is defined at the point, 

2. the function has a limit from that side at that point and 

3. the one-sided limit equals the value of the function at the point. 

A function will be continuous at a point if and only if it is continuous from both sides at that 
point. Now we can define what it means for a function to be continuous on a closed interval. 

Definition: (continuity on a closed interval) 

A function is said to be continuous on [a, b] if and only if 

1. it is continuous on ( a , b), 

2. it is continuous from the right at a and 

3. it is continuous from the left at b. 


Notice that, if a function is continuous, then it is continuous on every closed interval contained 
in its domain. 


1 0.4 Intermediate Value Theorem 

The definition of continuity we’ve given might not seem to have much to do with the intuitive 
notion we started with of being able to draw the graph without lifting one’s pencil. Fortunately, 
there is a connection, given by the so-called intermediate value theorem, which says, informally, 
that if a function is continuous then its graph can be drawn without ever picking up one’s pencil. 
More precisely: 

Intermediate Value Theorem 

If a function / is continuous on a closed interval [a, b], then for every value k between /(a) and 
f{b) there is a value c between a and b such that /(c) = lc. 


76 





Intermediate Value Theorem 


10.4.1 Application: bisection method 


F(x) 



Figure 20: A few steps of the bisection method applied over the starting range [ai ;bi] . The 
bigger red dot is the root of the function. 


The bisection method is the simplest and most reliable algorithm to find zeros of a continuous 
function. 

Suppose we want to solve the equation f{x) = 0. Given two points a and b such that f(a) and 
f{b) have opposite signs, the intermediate value theorem tells us that / must have at least one 
root between a and b as long as / is continuous on the interval [a, b ] . If we know / is continuous 
in general (say, because it’s made out of rational, trigonometric, exponential and logarithmic 
functions), then this will work so long as / is defined at all points between a and b. So, let’s 
divide the interval [a, b] in two by computing c = {a + b)l 2. There are now three possibilities: 

1. /(c) = 0, 

2. /(a) and /(c) have opposite signs, or 

3. /(c) and f(b ) have opposite signs. 

In the first case, we’re done. In the second and third cases, we can repeat the process on the sub- 
interval where the sign change occurs. In this way we home in to a small sub-interval containing 
the zero. The midpoint of that small sub -interval is usually taken as a good approximation to the 
zero. 
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Note that, unlike the methods you may have learned in algebra, this works for any continuous 
function that you (or your calculator) know how to compute. 
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1 1 Formal Definition of the Limit 


In preliminary calculus, the concept of a limit is probably the most difficult one to grasp (if noth- 
ing else, it took some of the most brilliant mathematicians 150 years to arrive at it); it is also the 
most important and most useful. 

The intuitive definition of a limit is inadequate to prove anything rigorously about it. The prob- 
lem lies in the vague term "arbitrarily close". We discussed earlier that the meaning of this term 
is that the closer x gets to the specified value, the closer the function must get to the limit, so that 
however close we want the function to the limit, we can accomplish this by making x sufficiently 
close to our value. We can express this requirement technically as follows: 

Definition: (Formal definition of a limit) 

Let fix) be a function defined on an open interval D that contains c, except possibly at x = c. Let 
L be a number. Then we say that 

lim fix) = L 

if, for every e > 0, there exists a 8 > 0 such that for all x e D with 

0 < |x- c| < 8, 

we have 

|/(X)-L| <£ 


To further explain, earlier we said that "however close we want the function to the limit, we can 
find a corresponding x close to our value." Using our new notation of epsilon (e) and delta (5), 
we mean that if we want to make fix) within e of L, the limit, then we know that making x within 
8 of c puts it there. 

Again, since this is tricky, let’s resume our example from before: fix) = x 2 , at x = 2. To start, let’s 
say we want fix) to be within .01 of the limit. We know by now that the limit should be 4, so we 
say: for e = .01, there is some 8 so that as long as 0 < |x- c| < <5, then |/(x) -L\<e. 

To show this, we can pick any 8 that is bigger than 0, so long as it works. For example, you might 
pick .00000000000001, because you are absolutely sure that if x is within .00000000000001 of 2, 
then fix) will be within .01 of 4. This 8 works for e = .01. But we can’t just pick a specific value 
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for e , like .01, because we said in our definition "for every e > 0." This means that we need to be 
able to show an infinite number of 5s, one for each e. We can’t list an infinite number of 5s! 


Of course, we know of a very good way to do this; we simply create a function, so that for every e, 
it can give us a 5. In this case, one definition of 5 that works is 5(c) = j ^ 2 ’„ C ~ \ (see 


[\/(e + 4) -2, ife<4 
example 5 in choosing delta 1 for an explanation of how this delta was chosen) 


So, in general, how do you show that /(x) tends to L as x tends to c? Well imagine somebody gave 
you a small number e (e.g., say c = 0.03). Then you have to find a 5 > 0 and show that whenever 
0 < |x- c| < 5 we have |/(x) - L| < 0.03. Now if that person gave you a smaller c (say e = 0.002) 
then you would have to find another 5, but this time with 0.03 replaced by 0.002. If you can do 
this for any choice of e then you have shown that /(x) tends to L as x tends to c. Of course, the 
way you would do this in general would be to create a function giving you a 5 for every c, just as 
in the example above. 


1 Chapter 61.0.1 on page 403 
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11.1 Formal Definition of the Limit at Infinity 


Definition: (Limit of a function at infinity) 

We call L the limit of fix) as x approaches oo if for every number e > 0 there exists a 8 such that 
whenever x > 5 we have 

|/(X)-L| < E 

When this holds we write 

lim fix) = L 

x — *oo 

or 

fix) - L 
as 

X — > • oo. 

Similarly, we call L the limit of fix ) as x approaches -oo if for every number e > 0, there exists a 
number 8 such that whenever x < 8 we have 

|/(X)-L| < £ 

When this holds we write 

lim fix) = L 

x — *— OO 

or 

fix) - L 


Notice the difference in these two definitions. For the limit of fix) as x approaches oo we are 
interested in those x such that x > 8. For the limit of fix) as x approaches -oo we are interested 
in those x such that x < 8. 


11.2 Examples 


Flere are some examples of the formal definition. 
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Example 1 

We know from earlier in the chapter that 


lim - = 2 

x— »8 4 


What is 8 when e - 0.01 for this limit? 

We start with the desired conclusion and substitute the given values for /(x) and e: 


x 

4 


0.01 


Then we solve the inequality for x: 


7.96 < x < 8.04 


This is the same as saying 


-0.04 <x- 8 <0.04 


(We want the thing in the middle of the inequality to be x - 8 because that’s where we’re taking 
the limit.) We normally choose the smaller of |-0.04| and 0.04 for 8, so 8 = 0.04, but any smaller 
number will also work. 

Example 2 

What is the limit of /(x) = x + 7 as x approaches 4? 

There are two steps to answering such a question; first we must determine the answer — this is 
where intuition and guessing is useful, as well as the informal definition of a limit — and then 
we must prove that the answer is right. 

In this case, 1 1 is the limit because we know /(x) = x + 7 is a continuous function whose domain 
is all real numbers. Thus, we can find the limit by just substituting 4 in for x, so the answer is 
4 + 7 = 11. 

We’re not done, though, because we never proved any of the limit laws rigorously; we just stated 
them. In fact, we couldn’t have proved them, because we didn’t have the formal definition of 
the limit yet, Therefore, in order to be sure that 1 1 is the right answer, we need to prove that no 
matter what value of e is given to us, we can find a value of 8 such that 


|/(X)-Il| <£ 


whenever 
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|x-4| < 8. 

For this particular problem, letting <5 = a works (see choosing delta 2 for help in determining 
the value of 8 to use in other problems) . Now, we have to prove 

|/(x) — ll| < £ 

given that 


x-4| < 8 = a 


Since |x - 4| < a, we know 


|/(x) - ll| = |x + 7-ll| = |x-4|<£ 

which is what we wished to prove. 

Example 3 

What is the limit of /(x) = x 2 as x approaches 4? 

As before, we use what we learned earlier in this chapter to guess that the limit is 4 2 = 16. Also as 
before, we pull out of thin air that 


8=Ve+ 16-4 


Note that, since a is always positive, so is 8, as required. Now, we have to prove 

|x 2 — 16) < a 

given that 


x-4| < 8 = Ve+ 16-4 


We know that 


|x + 4| = |(x-4)+8|<|x-4|+8<5 + 8 


2 Chapter 61.0. 1 on page 403 
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(because of the triangle inequality), so 


x 2 -16| = |x — 4| • |x + 4| 

< 5-(5 + 8) 

< (v / 16 + £-4)-( v / 16 + £' + 4) 

< (\/T6+T) 2 — 4 2 
= £+ 16-16 


< 


£. 


Example 4 

Show that the limit of sin(l/x) as x approaches 0 does not exist. 

We will proceed by contradiction. Suppose the limit exists; call it L. For simplicity, we’ll assume 
that L ^ 1; the case for L = 1 is similar. Choose e = 1 1 - L \ . Then if the limit were L there would be 
some 8 > 0 such that |sin(l/x) - L\ < e = |1 - L\ for every x with 0 < |x| < 8. But, for every 8 > 0, 
there exists some (possiblyvery large) n such that 0 < xo = ^. /2 + 2 ^„ <S, but | sin(l/xo)-L| = |1-L|, 
a contradiction. 

Example 5 

What is the limit of xsin(l/x) as x approaches 0? 

By the Squeeze Theorem, we know the answer should be 0. To prove this, we let 8 = e. Then for 
all x, if 0 < |x| < 8, then |xsin(l/x) - 0| < |x| < £ as required. 

Example 6 

Suppose that lim x ^ fl fix) = L and lim_ vw/ g(x) = M. What is lim x _ a (/(x) + g(x))? 

Of course, we know the answer should be L + M, but now we can prove this rigorously. Given 
some £, we know there’s a <5i such that, for any x with 0 < |x- a\ < 8\, |/(x) - L\ < e!2 (since the 
definition of limit says "for any e" , so it must be true for el 2 as well). Similarly, there’s a 5 2 such 
that, for any x with 0 < |x- a\ < d 2 , |g(x) -M\< e/2. We can set 8 to be the lesser of <5i and <5 2 . 
Then, for any x with 0 < \x- a\ < 8, |(/(x) + g(x)) - (L + M) \ < |/(x) - L\ + |g(x) - M\ < e/2+e/2 = 
£, as required. 

If you like, you can prove the other limit rules too using the new definition. Mathematicians 
have already done this, which is how we know the rules work. Therefore, when computing a 
limit from now on, we can go back to just using the rules and still be confident that our limit is 
correct according to the rigorous definition. 
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1 1 .3 Formal Definition of a Limit Being Infinity 


Definition: (Formal definition of a limit being infinity) 

Let f{x ) be a function defined on an open interval D that contains c, except possibly at x = c. 
Then we say that 


lim f{x) — oo 

if, for every e, there exists a 8 > 0 such that for all x e D with 


0 < |x- c\ < 8, 


we have 


fix) > e 


When this holds we write 


lim/(x) =oo 
or 


as 


fix ) — oo 
x — • c 


Similarly, we say that 

lim/(x) = -oo 

if, for every e, there exists a 8 > 0 such that for all x e D with 


we have 


0 < \x- c\ < 8, 


fix) < E 


When this holds we write 


lim fix) = -oo 

86 x^c J 

or 


fix) — -oo 


as 



Exercises 


11.4 Exercises 

11.5 Limits with Graphs 


Given the following graph, evaluate the succeeding limits 


1 1 .6 Basic Limit Exercises 


1. lim x ^2(4x 2 -3x + 1) 

2. lim x - 5 (x 2 ) 

Solutions 3 


11.7 One Sided Limits 

Evaluate the following limits or state that the limit does not exist. 

.. r 3 4-r 2 

L hm^o- 

2. lim x — 7 - |x 2 + x| -x 

3. lim x — _i- \J\ - x 2 

4. lim x — _i+ Vl - x 2 


11.8 Two Sided Limits 


Evaluate the following limits or state that the limit does not exist. 


1 . 

Iim,_ 

t 

“! x-1 

2. 

lim*— 

1 

4 x-4 

3. 

Unix- 

1 

2 x-2 

4. 

Unix- 

x 2 -9 
-3 x+3 

5. 

Unix- 

x 2 -9 
3 x-3 

6. 

limx- 

x 2 +2x+ 1 
_1 x+1 

7. 

limx- 

x 3 + l 
— 1 x+1 

8. 

linix- 

x 2 +5x- 36 
4 x 2 -16 

9. 

lim x — 

x-25 
25 v/x-5 

10. 

limx- 

|x| 

o — 

11. 

linix- 

1 

2 (x-2) 2 

12. 

limx- 

Vx 2 + 16 
3 x-3 

13. 

linix- 

3x 2 -8x-: 
~ 2 2x 2 -18 

14. 

limx— 

x 2 +2x+ 1 
2 x 2 -2x+l 


3 Chapter 62.2 on page 408 
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15. lim x _ 3 

16 - F+x 

17. lim^i ^ 

18. lim A ^ix 3 + 5x- 2 T 7 

19 ' lim ^l .r' V 2.v' 3 

2°. lim^r 
Solutions 4 


1 1.9 Limits to Infinity 


Evaluate the following limits or state that the limit does not exist. 


1 . lim x _ 

2 . Iim x _ 

3. lim x ^ 

4. lim x _ 

5. Iim x _ 

6 . lim x _ 

7. lim x _ 


-x+n 

00 x 2 +3x+2 
x z +2x+l 
3x 2 + l 
3x z +x 
_ °° 2x 2 -15 
2x z -32 
00 x 3 -64 
3x z +4x 
00 x 4 +2 

3x~+2x+3 
-°° 2x 2 +3 

x 2 +2 
oo X 3_2 


11.10 Limits of Piece Functions 


Evaluate the following limits or state that the limit does not exist. 
1 . Consider the function 


a) lim x ^ 2 -fix) 

b) lim x ^ 2 + fix) 

c) lim x _ 2 fix) 

2. Consider the function 


fix) = 


(x- 2) 2 

x-3 


if x < 2 
if x > 2 . 


a) lim x _ 4 + f{x) 

b) lim x ^ 4 -fix) 


fix) = < 


— 2x + 1 
x + 1 
x 2 + 2 


if x < 0 
if 0 < x < 4 
if x > 4. 


4 http : / /en . wikibooks . org/wiki/Calculus%2FLimits%2FSolutions%23Harder%20Limit% 
20EXERCISES 
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c) lim x ^ 0 + fix) 

d) lim x ^ 0 /M 

e) lim x ^ 0 fix) 

f) lim X ^ifix) 

3. Consider the function 


a) lim x ^ 0 /(x) 

b) lim x ^ 2 -/W 

c) lim x ^ 2 + fix) 

d) lim X ^ 2 fix) 


fix) = { 


2x — 3 
8 

-x + 3 


if x < 2 
if x = 2 
if x > 2. 


11.11 Limits using L’HopitaTs rule 


i 1 ; x+tanx 

L lim ^o^Ecr 

2 - 

3- l^oip 

4. lim^ooji 

5. 


Solutions 5 


5 Chapter 62.5 on page 409 
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Formal Definition of the Limit 
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12 Differentiation 


12.1 What is differentiation? 

Differentiation is an operation that allows us to find a function that outputs the rate of change 
of one variable with respect to another variable. 

Informally, we may suppose that we’re tracking the position of a car on a two-lane road with 
no passing lanes. Assuming the car never pulls off the road, we can abstractly study the car’s 
position by assigning it a variable, x. Since the car’s position changes as the time changes, we 
say that x is dependent on time, or x = x(t). This tells where the car is at each specific time. 
Differentiation gives us a function dx/dt which represents the car’s speed, that is the rate of 
change of its position with respect to time. 

Equivalently, differentiation gives us the slope at any point of the graph of a non-linear function. 
For a linear function, of form fix) = ax + b, a is the slope. For non-linear functions, such as 
f[x ) = 3x 2 , the slope can depend on x; differentiation gives us a function which represents this 
slope. 


12.2 The Definition of Slope 

Historically, the primary motivation for the study of differentiation was the tangent line prob- 
lem: for a given curve, find the slope of the straight line that is tangent to the curve at a given 
point. The word tangent comes from the Latin word tangens, which means touching. Thus, to 
solve the tangent line problem, we need to find the slope of a line that is "touching" a given curve 
at a given point, or, in modern language, that has the same slope. But what exactly do we mean 
by "slope" for a curve? 

The solution is obvious in some cases: for example, a line y = mx + c is its own tangent; the slope 
at any point is m. For the parabola y = x 2 , the slope at the point (0,0) is 0; the tangent line is 
horizontal. 

But how can you find the slope of, say, y = sinx + x 2 at x = 1.5? This is in general a nontrivial 
question, but first we will deal carefully with the slope of lines. 
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Differentiation 


12.2.1 Of a Line 



Figure 21: Three lines with different slopes 


The slope of a line, also called the gradient of the line, is a measure of its inclination. A line that 
is horizontal has slope 0, a line from the bottom left to the top right has a positive slope and a 
line from the top left to the bottom right has a negative slope. 

The slope can be defined in two (equivalent) ways. The first way is to express it as how much the 
line climbs for a given motion horizontally. We denote a change in a quantity using the symbol A 
(pronounced "delta"). Thus, a change in x is written as Ax. We can therefore write this definition 
of slope as: 


Slope = 


Ay 

Ax 


An example may make this definition clearer. If we have two points on a line, P(xi,yi) and 
Q (x 2 ,y 2 ), the change in x from P to Q is given by: 


Ax = x 2 - Xi 
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The Definition of Slope 


Likewise, the change in y from P to Q is given by: 

Ay = y 2 - yi 

This leads to the very important result below. 

The slope of the line between the points (xi, yi) and (X2, y 2 ) is 

Ay _ y 2 -yi 
Ax x 2 - Xi 


Alternatively, we can define slope trigonometrically, using the tangent function: 


Slope = tan (a) , 

where a is the angle from the line to the rightward-pointing horizontal (measured clockwise). 
If you recall that the tangent of an angle in a right triangle is defined as the length of the side 
opposite the angle over the length of the leg adjacent to the angle, you should be able to spot the 
equivalence here. 


12.2.2 Of a Graph of a Function 


The graphs of most functions we are interested in are not straight lines (although they can be) 
but rather curves. We cannot define the slope of a curve in the same way as we can for a line. In 
order for us to understand how to find the slope of a curve at a point, we will first have to cover 
the idea of tangency. Intuitively, a tangent is a line which just touches a curve at a point, such 
that the angle between them at that point is zero. Consider the following four curves and lines: 

(i) (ii) 
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Differentiation 




<ol style="list-style-type:lower-roman"> 


The line L crosses, but is not tangent to C at P. 

The line L crosses, and is tangent to C at P. 

The line L crosses C at two points, but is tangent to C only at P. 
There are many lines that cross C at P, but none are tangent. In fact, 
at P. 


this curve has no tangent 


</ol> 

A secant is a line drawn through two points on a curve. We can construct a definition of a tangent 
as the limit of a secant of the curve taken as the separation between the points tends to zero. 
Consider the diagram below. 
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The Definition of Slope 



As the distance h tends to zero, the secant line becomes the tangent at the point x (1 . The two 
points we draw our line through are: 


P(xo,/(x 0 )) 


and 


Q (x 0 + h,f (x 0 + h)) 

As a secant line is simply a line and we know two points on it, we can find its slope, m/,, using the 
formula from before: 


T2-T1 

X 2 -Xi 
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Differentiation 


(We will refer to the slope as m/, because it may, and generally will, depend on h.) Substituting 
in the points on the line, 

/ (x 0 + h)—f (x 0 ) 

mh = . 

(x 0 + h)-x o 

This simplifies to 

/ (x 0 + h)—f (x 0 ) 

m h = . 

h 

This expression is called the difference quotient. Note that h can be positive or negative — it is 
perfectly valid to take a secant through any two points on the curve — but cannot be 0. 

The definition of the tangent line we gave was not rigorous, since we’ve only defined limits of 
numbers — or, more precisely, of functions that output numbers — not of lines. But we can 
define the slope of the tangent line at a point rigorously, by taking the limit of the slopes of the 
secant lines from the last paragraph. Having done so, we can then define the tangent line as well. 
Note that we cannot simply set h to zero as this would imply division of zero by zero which would 
yield an undefined result. Instead we must find the limit 1 of the above expression as h tends to 
zero: 

Definition: (Slope of the graph of a function) 

The slope of the graph of fix) at the point (xo, /(xo)) is 

f (x 0 h) — f (x 0 ) 

lim 

h — >0 h 

If this limit does not exist, then we say the slope is undefined. 

If the slope is defined, say m, then the tangent line to the graph of /(x) at the point (xo, /(xo)) is 
the line with equation 

y~ fix q) = m-(x-xo) 


This last equation is just the point-slope form for the line through (xo, /(xo)) with slope m. 

12.2.3 Exercises 

1. Find the slope of the tangent to the curve y - x 2 at (1, 1). 

Answer: The definition of the slope of / at xo is lim/,^ 0 ^ (Xo) 

Substituting in fix) - x 2 and xo = 1 gives: 

1 Chapter 1 . 1 on page 1 
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The Rate of Change of a Function at a Point 


(l + fi) 2 -l 

II 

43 

h 2 + 2 h 

h 

h 


II 

43 

hih + 2) 


h 


= lim h + 2 
h—> 0 


= 2 


12.3 The Rate of Change of a Function at a Point 

Consider the formula for average velocity in the x-direction, where Ax is the change in x over 
the time interval A t. This formula gives the average velocity over a period of time, but suppose 
we want to define the instantaneous velocity. To this end we look at the change in position as the 
change in time approaches 0. Mathematically this is written as: lim^ t _o ff . which we abbrevi- 
ate by the symbol • (The idea of this notation is that the letter d denotes change.) Compare the 
symbol d with A. The (entirely non-rigorous) idea is that both indicate a difference between two 
numbers, but A denotes a finite difference while d denotes an infinitesimal difference. Please 
note that the symbols dx and dl have no rigorous meaning on their own, since lim^ r _o A/ = 0, 
and we can’t divide by 0. 

(Note that the letter 5 is often used to denote distance, which would yield • The letter d is often 
avoided in denoting distance due to the potential confusion resulting from the expression f jfj .) 


12.4 The Definition of the Derivative 

You may have noticed that the two operations we’ve discussed — computing the slope of the tan- 
gent to the graph of a function and computing the instantaneous rate of change of the function 
— involved exactly the same limit. That is, the slope of the tangent to the graph of y = /(x) is ^ . 
Of course, ^ can, and generally will, depend on x, so we should really think of it as a function 
of x. We call this process (of computing ^) differentiation. Differentiation results in another 
function whose value for any value x is the slope of the original function at x. This function is 
known as the derivative of the original function. 

Since lots of different sorts of people use derivatives, there are lots of different mathematical 
notations for them. Here are some: 

• fix) (read "f prime of x") for the derivative of fix), 

• D x [f(x)], 

• D/(x), 

• ^ for the derivative of y as a function of x or 

• 4 - [y] , which is more useful in some cases. 

Most of the time the brackets are not needed, but are useful for clarity if we are dealing with 
something like D[fg), where we want to differentiate the product of two functions, / and g. 

The first notation has the advantage that it makes clear that the derivative is a function. That is, 
if we want to talk about the derivative of fix) at x = 2, we can just write /'( 2). 
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Differentiation 


In any event, here is the formal definition: 

Definition: (derivative) 

Let fix) be a function. Then fix) = lim Ax _ () f^ x+/Xx ^~f M wherever this limit exists. In this case 
we say that / is d iff erentiable at x and its derivative at x is f (x) . 


12.4.1 Examples 
Example 1 

The derivative of fix) - x/2 is 

' x+Ax: _ x ' 

fix) = lim — — = lim 

Ax^O Ax Ax^O 

) 

no matter what x is. This is consistent with the definition of the derivative as the slope of a 
function. 

Example 2 

What is the slope of the graph of y = 3x 2 at (4,48)? We can do it "the hard (and imprecise) way", 
without using differentiation, as follows, using a calculator and using small differences below 
and above the given point: 

When x = 3.999, y = 47.976003. 

When x = 4.001, y = 48.024003. 

Then the difference between the two values of x is Ax = 0.002. 

Then the difference between the two values of y is Ay = 0.048. 

Thus, the slope = ^ = 24 at the point of the graph at which x = 4. 

But, to solve the problem precisely, we compute 



Ihm^o 3 ^ 2 ^ 8 I = 3lim M ^f^ Ml = 31im A x^ 0 16+8A ^Ax)*-i6 , , = 

3 limAx-o 8Aa ^ Aa) |- | | = 3lim Ax ^o(8 + Ax) |- | | = 3(8) |- | | = 24. |} We were lucky this time; 
the approximation we got above turned out to be exactly right. But this won’t always be so, and, 
anyway, this way we didn’t need a calculator. 

In general, the derivative of fix) = 3x 2 is 


| fix) | = lim Ax ^ 0 3(x+A A f- 3x2 |-|| = 3lim A ^ 0 
3lim Ax ^ 0 2xAx +j Ax) ~ |- 1 1 = 31im Ax ^ 0 (2x + Ax) |- 


(x+Ax) 2 -x 2 

Ax 


= 3(2x) 


Example 3 


i oi' x 2 +2xAx+(Ax) 2 -x 2 i i 

| = 31im Ax ^ 0 xic r I 

= 6x. |} 
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The Definition of the Derivative 


f -1, x<0 

ff fix ) = |x| (the absoiute value function) then fix ) = < undefined, x = 0 . 

( 1, x>0 

Here, fix) is not smooth (though it is continuous) at x = 0 and so the limits lim A _ () fix) and 
lim x _ 0 f (x) (the limits as 0 is approached from the right and left respectively) are not equal. 
From the definition, /'( 0) = limAx^o which does not exist. Thus, /'(0)is undefined, and so 
fix) has a discontinuity at 0. This sort of point of non-differentiability is called a cusp. Func- 
tions may also not be differentiable because they go to infinity at a point, or oscillate infinitely 
frequently. 


12.4.2 Understanding the Derivative Notation 

Notation for differentiation 2 The derivative notation is special and unique in mathematics. 
The most common notation for derivatives you’ll run into when first starting out with differenti- 
ating is the Leibniz notation, expressed as You may think of this as "rate of change in y with 

respect to x". You may also think of it as "infinitesimal value of y divided by infinitesimal value 
of x". Either way is a good way of thinking, although you should remember that the precise defi- 
nition is the one we gave above. Often, in an equation, you will see just ^ , which literally means 
" derivative with respect to x" . This means we should take the derivative of whatever is written to 
the right; that is, ^ (x + 2) means ^ where y = x + 2. 

As you advance through your studies, you will see that we sometimes pretend that dy and dx are 
separate entities that can be multiplied and divided, by writing things like dy = x 4 dx. Eventually 
you will see derivatives such as which just means that the input variable of our function is 

called y and our output variable is called x; sometimes, we will write ^ , to mean the derivative 
with respect to y of whatever is written on the right. In general, the variables could be anything, 

Qoy dQ 

sa Y JT- 

All of the following are equivalent for expressing the derivative of y = x 2 

. ^y = 2x 

dx 

• S * 2 = 2 * 

• dy = 2xdx 

• fix) = 2x 

• Difix)) = 2x 


12.4.3 Exercises 

1. Using the definition of the derivative find the derivative of the function fix) = 2x + 3. 

2. Using the definition of the derivative find the derivative of the function fix) = x 3 . Now try 
fix) = x 4 . Can you see a pattern? fn the next section we will find the derivative of fix) = x n for 
all n. 

3. The text states that the derivative of |x| is not defined at x = 0. Use the definition of the 
derivative to show this. 


2 http://en.wikipedia.org/wiki/Notation%20for%20differentiation 
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Differentiation 


4. Graph the derivative to y = 4x 2 on a piece of graph paper without solving for dyldx. Then, 
solve for dyldx and graph that; compare the two graphs. 

5. Use the definition of the derivative to show that the derivative of sinx is cosx. Hint: Use a 
suitable sum to product formula and the fact that lini/_o = 1. 

12.5 Differentiation rules 

The process of differentiation is tedious for complicated functions. Therefore, rules for differ- 
entiating general functions have been developed, and can be proved with a little effort. Once 
sufficient rules have been proved, it will be fairly easy to differentiate a wide variety of functions. 
Some of the simplest rules involve the derivative of linear functions. 


12.5.1 Derivative of a Constant Function 

For any fixed real number c, 


d_ 

dx 


[c] = 0. 


Intuition 

The graph of the function /(x) = c is a horizontal line, which has a constant slope of zero. There- 
fore, it should be expected that the derivative of this function is zero, regardless of the values of 
x and c. 


Proof 

The definition of a derivative is 


/(x + Ax)-/(x) 

Inn . 

Ax— >0 Ax 

Let f{x) - c for all x. (That is, / is a constant function.) Then fix + Ax) = c. Therefore 

d , c- c 
— [c] = Inn — — = 0. 

dx Ax— »o Ax 


Examples 


1. £[31=0 

2. £[z] = 0 

Note that, in the second example, z is just a constant. 
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Differentiation rules 


12.5.2 Derivative of a Linear Function 

For any fixed real numbers m and c, 

[mx + c] = m 

The special case ^| = 1 shows the advantage of the ^ notation -- rules are intuitive by basic 
algebra, though this does not constitute a proof, and can lead to misconceptions to what exactly 
dx and dy actually are. 


Intuition 

The graph of y = mx + c is a line with constant slope m. 


Proof 

If f{x) = mx + c, then fix + Ax) = mix + Ax) + c. So, 


| fix) | = lim A ^ 0 | = lim A ^ 0 ^x + Ax)-mx 

lim Ax _ 0 I- 1 1 = m - If 


lim Ax _ 0 


mx+mAx-mx 

Ax 


12.5.3 Constant multiple and addition rules 


Since we already know the rules for some very basic functions, we would like to be able to take 
the derivative of more complex functions by breaking them up into simpler functions. Two tools 
that let us do this are the constant multiple rule and the addition rule. 


The Constant Rule 

For any fixed real number c, [c/(x)] = [fix)] 

The reason, of course, is that one can factor c out of the numerator, and then of the entire limit, 
in the definition. 

Example 

We already know that 


d_ 

dx 


[x 2 ] =2x 


Suppose we want to find the derivative of 3x 

{| 


2 
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Differentiation 


HiU 3 * 2 ] I = 3 iU x2 ] |-|| = 3x2x |-|| = 6x |-|} 

Another simple rule for breaking up functions is the addition rule. 

The Addition and Subtraction Rules 

^[/(x)±g(x)] = ^[/(x)]±^[g(x)] 

Proof 

From the definition: 

i™ [/(x+Ax)±g(x+Ax)]-[/(x)±g(x)] 

l lm Ax— >0 Ax 

_ i:„ [/(x+Ax)-/(x)] + [g(x+Ax)-g(x)] 

- HITIAx— 0 Ax 

_ 1 ;™ [/(x+Ax)-/(x)] , [g(x+Ax)-g(x)] ' 

- llm Ax-0 Ai ± llm Ax-0 Ax 

By definition then, this last term is [/(x)] ± ^ [g(x)] 

Example 
What is: 

{| 

I- \£ ( 3x2 + 5x ] I = -fx ( 3x2 + 5x l I' 1 1 = £ I 3x2 ] + fx t 5x l I' 1 1 = 6x+ A t 5x l I' 1 1 = 6 x + 5 I- |i 

The fact that both of these rules work is extremely significant mathematically because it means 
that differentiation is linear. You can take an equation, break it up into terms, figure out the 
derivative individually and build the answer back up, and nothing odd will happen. 

We now need only one more piece of information before we can take the derivatives of any poly- 
nomial. 

12.5.4 The Power Rule 

[x n ] = nx n ~ l For example, in the case of x 2 the derivative is 2X 1 = 2x as was established earlier. 
A special case of this rule isthatdx/dx= dx l /dx = lx° = 1. 

Since polynomials are sums of monomials, using this rule and the addition rule lets you dif- 
ferentiate any polynomial. A relatively simple proof for this can be derived from the binomial 
expansion theorem. 

This rule also applies to fractional and negative powers. Therefore 

{| 

j*iiUv^]i = iU xl/2 ] MI = K 1 / 2 MI^I! 
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Differentiation rules 


12.5.5 Derivatives of polynomials 

With these rules in hand, you can now find the derivative of any polynomial you come across. 
Rather than write the general formula, let’s go step by step through the process. 

— [6x 5 + 3x 2 + 3x + l] 
dx 

The first thing we can do is to use the addition rule to split the equation up into terms: 

d r C, d r 9 T d d 

We can immediately use the linear and constant rules to get rid of some terms: 

-j— [6x 5 ] + [3x 2 ] + 3 + 0. 

dx dx 

Now you may use the constant multiplier rule to move the constants outside the derivatives: 


6^-[x 5 ]+3^-[x 2 ]+3. 
dx dx 

Then use the power rule to work with the individual monomials: 


6 (5x 4 ) + 3 (2x) + 3. 

And then do some algebra to get the final answer: 

30x 4 + 6x + 3. 

These are not the only differentiation rules. There are other, more advanced, differentiation 
rules 3 , which will be described in a later chapter. 

12.5.6 Exercises 

• Find the derivatives of the following equations: 

/(x) = 42 


/(x) = 6x+ 10 


3 http://en.wikibooks.org/wiki/ Calculus %2FMore_D i fferent i at ion_Rules 
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Differentiation 


/(x) = 2x z + 12x + 3 

• Use the definition of a derivative to prove the derivative of a constant function, of a linear 
function, and the constant rule and addition or subtraction rules. 

• Answers: 

fix) = 0 


/'(*)= 6 


fix) =4x+12 
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13 Product and Quotient Rules 


13.1 Product Rule 

When we wish to differentiate a more complicated expression such as: 

h(x) = ( x 2 + 5x + 7) • (x 3 + 2x - 4) 

our only way (up to this point) to differentiate the expression is to expand it and get a polynomial, 
and then differentiate that polynomial. This method becomes very complicated and is particu- 
larly error prone when doing calculations by hand. A beginner might guess that the derivative of 
a product is the product of the derivatives, similar to the sum and difference rules, but this is not 
true. To take the derivative of a product, we use the product rule. 


Derivatives of products (Product rule) 

^ [/(x) • g(x)] = fix) ■ g'(x) + fix ) • g(x) 


13.1.1 Proof 

Proving this rule is relatively straightforward, first let us state the equation for the derivative: 

d , . .. f ix + h) ■ gix + h) - f (x) • g(x) 

— /(x) • g(x) = hm 5 — 

ax h — >o h 

We will then apply one of the oldest tricks in the book — adding a term that cancels itself out to 
the middle: 


j fix + h ) • gjx + h)- f(x + h) ■ g(x) + f(x + h) • g(x) - fix ) • g(x) 

h — *0 h 

Notice that those terms sum to zero, and so all we have done is add 0 to the equation. Now we 
can split the equation up into forms that we already know how to solve: 


T x I/W'SW] 


d_ 

dx 


[fix) -g(x)] = lim 
h — *0 


fix+h)- gix + h) - fix + h) ■ gix) fix+h)- g(x) - fix) ■ g(x) 
h + h 
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Product and Quotient Rules 


Looking at this, we see that we can separate the common terms out of the numerators to get: 


~T [fix) -gix)] = lim 
dx L 1 h -~ o 


nx + h) glx+h) - gW +g(x) nx+h) ~ m 


Which, when we take the limit, becomes: 


[fix) ■ g(JC)] = fix) ■ g'ix) + gix) ■ fix) 

, or the mnemonic "one D-two plus two D-one" 

This can be extended to 3 functions: 

4~[ fgh\ = f ix) gix) h'ix) + fix) g'ix) hix) + fix)gix)hix) 
dx 

For any number of functions, the derivative of their product is the sum, for each function, of its 
derivative times each other function. 

Back to our original example ofaproduct, hix) = (x 2 + 5x+7)-Cx 3 + 2x-4), we find the derivative 
by the product rule is 


h' ix) = ix 2 + 5x + 7) (3x 2 + 2) + (2x + 5) (x 3 + 2x - 4) = 5x 4 + 20x 3 + 27x 2 + 12x - 6 
Note, its derivative would not be 


(2x + 5) • (3x 2 + 2) = 6x 3 + 15x 2 + 4x + 10 

which is what you would get if you assumed the derivative of a product is the product of the 
derivatives. 

To apply the product rule we multiply the first function by the derivative of the second and add 
to that the derivative of first function multiply by the second function. Sometimes it helps to 
remember the memorize the phrase "First times the derivative of the second plus the second 
times the derivative of the first." 


13.2 Application, proof of the power rule 

The product rule can be used to give a proof of the power rule for whole numbers. The proof 
proceeds by mathematical induction 1 . We begin with the base case n = 1. If /i(x) = x then 
from the definition is easy to see that 


1 http://en.wikipedia.org/wiki /Mat hem at ical_induction 
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Examples 


„ , , x+h- 

f (x) = lim — 

h — >0 fi 


= 1 


Next we suppose that for fixed value of N, we know that for /at(x) = x N , f' N (x) = Nx n 1 . Consider 
the derivative of fy+i (x) = x"' v+1 , 


/jv+ i(x) = ix-x N y = (xl'x^ + x-ix^)' - x N + x-N-x n 1 = {N+ 1)x n . 

We have shown that the statement f' n (x) = n-x n ~ l is true for n = 1 and that if this statement holds 
for n = N, then it also holds for n = N+ 1. Thus by the principle of mathematical induction, the 
statement must hold for n = 1,2, 


13.3 Quotient rule 

There is a similar rule for quotients. To prove it, we go to the definition of the derivative: 


d f{x) 
dx g(x) 


lim 

h—> 0 


f (x+ h) _ f(x ) 
g(x+h) g(x) 

h 


lim fix + h)gix)-fix)gix + h) 
h^o hgix)g[x+h) 

lim /(x+fi)g(x)-/(x)g(x) + fix)gix)-fix)g{x+h) 
h^o hg[x)gix+ h) 


lim 

h—> 0 




f(x+h)-f{x ) 
h 


-fix) 


g(x+h)-g{x) 

h 


g(x)/'(x) ■ 


gix)gix+h) 

-/(x)g'(x) 


gix) 2 


This leads us to the so-called "quotient rule": 


Derivatives of quotients (Quotient Rule) 


d fix) 
dx g(x) 


gjx)fjx)- fjx)g'jx) 
gix) 2 


Which some people remember with the mnemonic "low D-high minus high D-low (over) square 
the low and away we go!" 


13.4 Examples 


The derivative of (4x - 2) / (x 2 + 1) is: 
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Product and Quotient Rules 


d (4x - 2) 
dx x 2 + 1 


(x 2 + 1) (4) - (4x — 2) (2x) 
(x 2 + 1) 2 

(4x 2 + 4) - (8x 2 - 4x) 

(x 2 + 1) 2 
-4x 2 + 4x + 4 
(x 2 + 1) 2 


Remember: the derivative of a product/quotient is not the product/quotient of the derivatives. 
(That is, differentiation does not distribute over multiplication or division.) However one can 
distribute before taking the derivative. That is ^ ((a + b) x (c + d)) = jjj ( ac + ad + bc + bd ) 
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14 Derivatives of Trigonometric Functions 


Sine, Cosine, Tangent, Cosecant, Secant, Cotangent. These are functions that crop up continu- 
ously in mathematics and engineering and have a lot of practical applications. They also appear 
in more advanced mathematics, particularly when dealing with things such as line integrals with 
complex numbers and alternate representations of space like spherical and cylindrical coordi- 
nate systems. 

We use the definition of the derivative, i.e., 


fix) = lim 

h — *0 


fix+h)- fix) 
h 


to work these first two out. 

Let us find the derivative of sin x, using the above definition. 
f{x) = sinx 

fix) = lim, 7 _ 0 sin(x+ ^~ sinx Definition of derivative 

= lim^o cos M sin(/i)+cos(/i) sin(x)-s;n.(x) trigonometric identity 
= lim/^o cus( - Y)sin(;,)+( / c ; os(,,l ~ 1)sin( - Y) factoring 

= lim^o C0S(x) / n(,i) +lim^o Tos(/i)-l) sin(x) separation of terms 

= cosx x 1 + sinx x 0 application of limit 
= cosx solution 


Now for the case of cos x 
fix) = cosx 

fix) = lim /; _ 0 cos (*+fti-cosx Definition of derivative 
= lim^o cos M cos(fo) -sin(/») sin(x)-cos(x) trigonome tric identity 
= lim^o cos ^Kcos(/i)-l)-sin M sin(/,) factoring 

= lim^o C0SW(C / °S (, |)- 1) -lim^o sin(x) ; f n(/i) separation of terms 
= cos x x 0 - sin x x 1 application of limit 
= - sinx solution 


Therefore we have established 
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Derivative of Sine and Cosine 


— sin(x) = cos(x) 
dx 


— cos(x) = -sin(x) 
ax 


To find the derivative of the tangent, we just remember that: 


tan(x) = 


sin(x) 

cos(x) 


which is a quotient. Applying the quotient rule, we get: 

- COS 2 (x)+sin 2 (x) 

dx tan(xj - cos 2 (x) 


Then, remembering that cos 2 (x) + sin 2 (x) = 1, we simplify: 


cos 2 (x)+sin 2 (x) 
cos 2 (x) 


_ 1 

cos 2 (x) 

= sec 2 (x) 


Derivative of the Tangent 


U o 

— tan(x) = sec (x) 
ax 


For secants, we just need to apply the chain rule to the derivations we have already determined. 


1 

sec(x) = 

cos(x) 


So for the secant, we state the equation as: 


1 

sec(x) = — 
u 


u(x) = cos(x) 

Take the derivative of both equations, we find: 


d -1 

— sec(x) = — r 
dx u A 


du 

dx 


110 



Examples 


du 

— = - sin(x) 
dx 


Leaving us with: 


d 

— sec(x) = 
dx 


sin(x) 

cos 2 (x) 


Simplifying, we get: 


Derivative of the Secant 


— sec(x) = sec(x) tan(x) 
dx 


Using the same procedure on cosecants: 


1 

csc(x) = 

sin(x) 


We get: 


Derivative of the Cosecant 


d 

— csc(x) = - csc(x) cot(x) 
dx 


Using the same procedure for the cotangent that we used for the tangent, we get: 


Derivative of the Cotangent 


d o 

— cot(x) = - CSC (x) 
dx 
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15 Chain Rule 


The chain rule is a method to compute the derivative of the functional composition of two or 
more functions. 

If a function, f depends on a variable, u, which in turn depends on another variable, x, that is /= 
y(u(x)) , then the rate of change of /with respect to x can be computed as the rate of change of y 
with respect to u multiplied by the rate of change of u with respect to x. 


Chain Rule 

If a function f is composed to two differentiable functions y(x) and u(x), so that f(x) = 
y(u(x)), then f(x) is differentiable and, 


df _ dy du 
dx du dx 


The method is called the " chain rule" because it can be applied sequentially to as many functions 
as are nested inside one another. 1 For example, if f is a function of g which is in turn a function 
of h, which is in turn a function of x, that is 


/(g(h(x))), 


the derivative of f with respect to x is given by 


df _ df dg dh 
dx dg dh dx 


and so on. 


The chain rule has broad applications in Physics, Chemistry, and Engineering, as well as being 
used to study related rates in many disciplines. For example, if a skydiver is falling at 60 miles per 
hour and the barometric pressure increases at 3 psi per mile fallen, then the skydiver experiences 
an increasing pressure at a rate of 60 mi/hr * 3 psi/mi = 180 psi per hour. The chain rule can also 
be generalized to multiple variables in cases where the nested functions depend on more than 
one variable. 


15.1 Examples 

15.1.1 Example I 

Suppose that a mountain climber ascends at a rate of 0.5 kilometer per hour. The temperature 
is lower at higher elevations; suppose the rate by which it decreases is 6 °C per kilometer. To 

1 http://www.math.brown.edu/help/derivtips.html 
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calculate the decrease in air temperature per hour that the climber experiences, one multiplies 
6 °C per kilometer by 0.5 kilometer per hour, to obtain 3 °C per hour. This calculation is a typical 
chain rule application. 


15.1.2 Example II 


Consider the function fx) = (x 2 + l) 3 . It follows from the chain rule that 


/(x) = (x 2 + l) 3 
u(x) = x 2 + 1 
fix) = [u(x)] 3 

df _ df du 
dx du dx 

dx du u dx^ x ' 

=3lf ■ 2x 
^ = 3(x 2 + l) 2 -2x 
= 6x(x 2 + l) 2 


Function to differentiate 
Dehne u(x) as inside function 
Express f(x) in terms of u(x) 

Express chain rule applicable here 
Substitute in f(u) and u(x) 

Compute derivatives with power rule 
Substitute u(x) back in terms of x 
Simplify. 
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0 12 3 4 

u 



x 

Figure 27: Top: Inside function with du/dx = 2 at x = 1. Middle: Outside function with df/du = 12 
at u(l) = 2. Bottom: Composite function with df/dx = 2*12 = 24 at x = 1. 


15.1.3 Example III 

In order to differentiate the trigonometric function 


fix) = sin(x 2 ), 
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one can write: 

fix) - sin(x 2 ) 
u(x) = x 2 
fix) = sin(w) 
df _ df du 
dx du dx 

^=d_ s iniu)--^ix 2 ) 
^ = cos(u) -2x 
^ = cos(x 2 )(2x) 


Function to differentiate 
Define u(x) as inside function 
Express f(x) in terms of u(x) 
Express chain rule applicable here 
Substitute in f(u) and u(x) 

Evaluate derivatives 
Substitute u in terms of x. 


15.1.4 Example IV Three Nested Functions 


The method is called the " chain rule " because it can be applied sequentially to as many functions 
as are nested inside one another. For example, if figihix))) = e sm(x \ sequential application of 
the chain rule yields the derivative as follows: 


fix) = e sin[x2) = e g 
hix) = x 2 

g(x) = sin(h) = sin(x 2 ) 
df _ dj_ d£ dh 
dx dg dh dx 
df - p g - pSin(x 2 ) 
dg~ e ~ e 


= cos ih) = cos (x 2 ) 

dh _ 9 v 
dx 

_^ e sin(x) =e sin(x). cos(x 2 


•2x 


Original (outermost) function 
Dehne h(x) as innermost function 
g(h) = sin(h) as middle function 
Express chain rule applicable here 
Differentiate f(g) 

Differentiate g(h) 

Differentiate h(x) 

Substitute into chain rule. 


1 5.2 Chain rule in Physics 

Because one physical quantity often depends on another, which, in turn depends on others, 
the chain rule has broad applications in physics. This section presents examples of the chain 
rule in kinematics and simple harmonic motion, the chain rule is also useful in electromagnetic 
induction. 
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1 5.2. 1 Physics Example I: Relative Kinematics of Two Vehicles 


5 


4 


Q) 

E 

^ 2 

o 

c 

• 4 — • 

CO 

"O 1 

0 


-1 

-1 0 1 2 3 4 5 

distance (miles) 

Figure 28: One vehicle is headed North and currently located at (0,3); the other vehicle is 
headed West and currently located at (4,0). The chain rule can be used to find whether they are 
getting closer or further apart. 



For example, one can consider the kinematics problem where one vehicle is heading West toward 
an intersection at 80 miles per hour while another is heading North away from the intersection 
at 60 miles per hour. One can ask whether the vehicles are getting closer or further apart and at 
what rate at the moment when the North bound vehicle is 3 miles North of the intersection and 
the West bound vehicle is 4 miles East of the intersection. 

Big idea: use chain rule to compute rate of change of distance between two vehicles. 

Plan: 

1. Choose coordinate system 
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2. Identify variables 

3. Draw picture 

4. Big idea: use chain rule to compute rate of change of distance between two vehicles 

5. Express c in terms of x and y via Pythagorean theorem 

6. Express dc/dt using chain rule in terms of dx/ dt and dy/dt 

7. Substitute in x, y, dx/ dt, dy/dt 

8. Simplify. 

Choose coordinate system: Let the y-axis point North and the x-axis point East. 

Identify variables: Define y(t) to be the distance of the vehicle heading North from the origin 
and x(t) to be the distance of the vehicle heading West from the origin. 

Express c in terms of x and y via Pythagorean theorem: 

c = (x 2 + y 2 ) 1/2 

Express dc/dt using chain rule in terms of dx/dt and dy/dt: 


dc _ d_r v 2 , ,,2'|l/2 
dt - dt (x + y > 

= i(x 2 + y 2 r 1/2 ^(x 2 + y 2 ) 


= 7j(x 2 + y 2 ) _1/2 
= i(x 2 + y 2 ) _1/2 
= x f, +yin 


^(x 2 ) + ^(y 2 )| 

2x in + 2 y d in 


Apply derivative operator to entire function 
Sum of squares is inside function 
Distribute differentiation operator 
Apply chain rule to x(t) and y(t)} 

Simplify. 


Substitute in x = 4 mi, y = 3 mi, dx/dt = -80 mi/hr, dy/dt = 60 mi/hr and Simplify 

dc Ami ■ {-80mi/hr) + 3mi • (60 )milhr 
dt sj {Ami) 2 + ( 3mi ) 2 

-320 mi 2 1 hr + 180m/ 2 1 hr 
5 mi 

-140 mi 2 /hr 
5 mi 

= -28 mil hr 

Consequently, the two vehicles are getting closer together at a rate of 28 mi/hr. 


118 



Chain rule in Physics 


15.2.2 Physics Example II: Harmonic Oscillator 



Figure 29: An undamped spring-mass system is a simple harmonic oscillator. 


If the displacement of a simple harmonic oscillator from equilibrium is given by x, and it is re- 
leased from its maximum displacement A at time t = 0, then the position at later times is given 
by 

x(f) = Acos{cot), 

where = 2 /T is the angular frequency and T is the period of oscillation. The velocity, v, being the 
first time derivative of the position can be computed with the chain rule: 


119 



Chain Rule 


v{t) = f t 
= jj- t Acos(co t) 

= Aj- t cos{cot) 

= A{-sin[a>t))-jj{(ot) 
- -Asm{a)t)-j- r {a)t) 

= -Asinio)t)o) 
vit) = —o)Asin(a) t). 


Definition of velocity in one dimension 
Substitute x(t) 

Bring constant A outside of derivative 
Differentiate outside function (cosine) 
Bring negative sign in front 
Evaluate remaining derivative 
Simplify. 


The acceleration is then the second time derivative of position, or simply dv/dt. 


ait) = % 

= ^(-mAsin(tof)) 

= -coA4- sinfinf) 

= -(l)AcOS(d) t)j- t (wt) 
= -Cl)AcOS(lOt)Q) 
ait ) = -£o 2 Acos(wf). 


Definition of acceleration in one dimension 
Substitute v(t) 

Bring constant term outside of derivative 
Differentiate outside function (sine) 
Evaluate remaining derivative 
Simplify. 


From Newton’s second law, F = ma, where F is the net force and m is the object’s mass. 


F = ma 

= mi-o) 2 AcosRnf)) 
= - mo) 2 Acosio)t ) 

F = -mo) 2 xit). 


Newton’s second law 
Substitute a(t) 
Simplify 

Substitute original x(t) 


Thus it can be seen that these results are consistent with the observation that the force on a simple 
harmonic oscillator is a negative constant times the displacement. 


1 5.3 Chain rule in Chemistry 

The chain rule has many applications in Chemistry because many equations in Chemistry describe 
how one physical quantity depends on another, which in turn depends on another. For example, 
the ideal gas law describes the relationship between pressure, volume, temperature, and number of 
moles, all of which can also depend on time. 
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15.3.1 Chemistry Example I: Ideal Gas Law 



Figure 30: Isotherms of an ideal gas. The curved lines represent the relationship between pressure 
and volume for an ideal gas at different temperatures: lines which arc further away from the origin 
(that is, lines that arc nearer to the top right-hand corner of the diagram) represent higher 
temperatures. 


Suppose a sample of n moles of an ideal gas is held in an isothermal (constant temperature, T) 
chamber with initial volume Vo. The ideal gas is compressed by a piston so that its volume changes 
at a constant rate so that V(t) = Vo - kt, where t is the time. The chain rule can be employed to find 
the time rate of change of the pressure. 2 The ideal gas law can be solved for the pressure, P to give: 


P{t) = 


nRT 

V(t)’ 


where P(t) and V(t) have been written as explicit functions of time and the other symbols are con- 
stant. Differentiating both sides yields 

= nRT— ( -5—) 

dt dt { V(t) ) ’ 

where the constant terms, n, R, and T, have been moved to the left of the derivative operator. Ap- 
plying the chain rule gives 


2 University of British Columbia, UBC Calculus Online Course Notes, Applications of the Chain Rule, 
http://www.ugrad.math.ubc.ca/coursedoc/mathlOO/notes/derivative/chainap.htmIAccessed 11/15/2010. 
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^ = nRT w(m)w = nRT ^W’ 

where the power rule has been used to differentiate 1 /V, Since V(t) = VO - kt, dV/dt = -k. Substituting 
in for V and dV/dt yields dP/dt. 

f 

To find P(t), one would integrate dP/dt. 

15.3.2 Chemistry Example II: Kinetic Theory of Gases 



Figure 3 1 : The temperature of an ideal monatomic gas is a measure of the average kinetic energy 
of its atoms. The size of helium atoms relative to their spacing is shown to scale under 1950 
atmospheres of pressure. The atoms have a certain, average speed, slowed down here two trillion 
fold from room temperature. 


A second application of the chain rule in Chemistry is finding the rate of change of the average 
molecular speed, v, in an ideal gas as the absolute temperature T, increases at a constant rate so that 
T = To + at, where To is the initial temperature and t is the tim e. 3 The kinetic theory of gases relates 
the ROOT MEAN SQUARE 4 of the molecular speed to the temperature, so that if v(t) and T(t) are 
functions of time, 


3 University of British Columbia, UBC Calculus Online Course Notes, Applications of the Chain Rule, 
http://www.ugrad.math.ubc.ca/coursedoc/mathlOO/notes/derivative/chainap.htmlAccessed 11/15/2010. 

4 http://en.wikibooks.org/wiki/root%20mean%20square 
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where R is the ideal gas constant, and M is the molecular weight. 


Differentiating both sides with respect to time yields: 



Using the chain rule to express the right side in terms of the with respect to temperature, T, and 
time, t, respectively gives 

dv _ _d_ (3 RT_\k dT 
dt ~ dT l M I dt ' 

Evaluating the derivative with respect to temperature, T, yields 

dv _ 1 r 3itr , '~| d ( 3RT } dT 
dt ~ 2 V M ) dT i M > dt ' 

Evaluating the remaining derivative with respect to T, taking the reciprocal of the negative power, 
and substituting T = To + at, produces 


dv 

dt 


-iG 


M 


~ 2 M£(T 0 + at). 


[3 R{T 0 +at) j 

Evaluating the derivative with respect to t yields 


dv _ 1 f M U 3 R 
dt 2 \3R(To + at) j M U 

which simplifies to 


1 

3 R [2 
M{To+at) J 

This can be integrated to find v(t), the rms molecular speed as a function of time. 


dv _ a 
dt ~ 2 


15.4 Polynomials 

We know how to differentiate regular polynomial functions. For example: 


(3x 3 - 6x 2 + x) = 9x 2 - 12x + 1 
ax 


f{x) = (x 2 + 5) 2 


fix) = x 4 + 10x 2 + 25 


fix) = 4x 3 + 20x 
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However, there is a useful rule known as the chain method rule. The function above (fix) = 
(x 2 + 5) 2 ) can be consolidated into two nested parts /(x) = u 2 , where u = m(x) = (x 2 + 5). Therefore: 

if 


giu) = u 2 

and 


u = m(x) = x 2 + 5 


Then: 


fix) = gimix)) 


Then 


fix) = g' imix))m! ix) 

The chain rule states that if we have a function of the form yiuix)) (i.e. y can be written as a 
function of u and u can be written as a function of x) then: 


dy dy du 
dx du dx 


Chain Rule 

If a function F(x) is composed to two differentiable functions g(x) and m(x), so that 
F(x)=g(m(x)), then F(x) is differentiable and. 


F'ix) = g' imix))m! (x) 


We can now investigate the original function: 


Therefore 


dy 

du 


= 2 u 


du 

dx 


= 2x 
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dy dy du o q 

— = — — = 2u-2x = 2{x z + 5)(2x) =4x 3 + 20x 

dx du dx 

This can be performed for more complicated equations. If we consider: 

d 

dx 

and let y = sJH and it= 1+x 2 , so that ^ and ^ = 2x, then, by applying the chain rule, we find 

that 




So, in just plain words, for the chain rule you take the normal derivative of the whole thing (make 
the exponent the coefficient, then multiply by original function but decrease the exponent by 1) then 
multiply by the derivative of the inside. 


15.5 References 

15.6 External links 

• http://calculusapplets.com/chainrule.html 
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16 High Order Derivatives 


The second derivative, or second order derivative, is the derivative of the derivative of a function. 
The derivative of the function fix) may be denoted by fix), and its double (or "second") derivative 
is denoted by fix). This is read as "f double prime of x," or "The second derivative of f(x)." 
Because the derivative of function / is defined as a function representing the slope of function /, 
the double derivative is the function representing the slope of the first derivative function. 

Furthermore, the third derivative is the derivative of the derivative of the derivative of a function, 
which can be represented by fix). This is read as "/triple prime of x", or "The third derivative 
of/x)". This can continue as long as the resulting derivative is itself differentiable, with the fourth 
derivative, the fifth derivative, and so on. Any derivative beyond the first derivative can be referred 

to as a higher order derivative. 

16.1 Notation 

Let f{x) be a function in terms of x. The following are notations for higher order derivatives. 


2nd Deriva- 
tive 

3rd Derivative 

4th Derivative 

nth Derivative 

Notes 

fix) 

fix) 

fix) 

fix) 

Probably the 
most common 
notation. 

d 2 f 

dx 2 

dfj_ 

dx 3 

d 4 f 

dx 4 

d n f 

dx' 1 

Leibniz nota- 
tion. 

£? [/M] 

ifd/M] 

£* [/w] 

£* [/<*)] 

Another form 
of Leibniz no- 
tation. 

D 2 f 

D 3 / 

D 4 / 

D n f 

Euler’s nota- 
tion. 


Warning: You should not write f n {x) to indicate the nth derivative, as this is easily confused with 
the quantity fix ) all raised to the nth power. 

The Leibniz notation, which is useful because of its precision, follows from 

d df\ _ d 2 f 
dx\dx dx 2 
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Newton’s dot notation extends to the second derivative, y, but typically no further in the applications 
where this notation is common. 


16.2 Examples 


Example 1: 


Find the third derivative of /(x) = 4x 1 2 3 4 5 + 6x 3 + 2x + 1 with respect to x. 


Repeatedly apply the POWER Rule 1 to find the derivatives. 

• f'{x) = 20x 4 + 18x 2 + 2 
. /"(x)= 80x 3 + 36x 

• = 240x 2 + 36 

Example 2: 


Find the third derivative of /(x) = 12sinx + ln(x + 2) + 2x with respect to x. 

Use the differentiation rules for EXPONENTIAL EXPRESSIONS 2 , LOGARITHMIC EXPRESSIONS 3 
and POLYNOMIALS 4 . 

• /'(x) = 12cosx+ 3^2 +2 

* /"(*) = -12sinx-^ 

. (x) = - 12 cos x + 

Applications: 

For applications of the second derivative in finding a curve’s concavity and points of inflection, see 
"Extrema and Points of Inflection 5 " and "Extreme Value Theorem 6 ". For applica- 
tions of higher order derivatives in physics, see the "KINEMATICS 7 " section. 


1 Chapter 12.5.4 on page 102 

2 http://en.wikibooks.org/wiki /Calculus %2FMore_D iffe rental i on_Rule s%23Exp onents 

3 http://en.wikibooks.org/wiki /Calculus%2FMore_D I ffe RENTAL I ON Ru L E s % 

23LOGARITHMS 

4 Chapter 12.5.4 on page 102 

5 Chapter 20 on page 149 

6 Chapter 25 on page 171 

7 Chapter 23 on page 165 
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17 Implicit Differentiation 


Implicit differentiation takes a relation and turns it into a rectangular regular equation. 


17.1 Explicit differentiation 


For example, to differentiate a function explicitly, 

x 2 + y 2 = 1 

First we can separate variables to get 


y 2 = 1 - x 2 

Taking the square root of both sides we get a function of y. 

y = + V 1 - x 2 

We can rewrite this as a fractional power as 

y = +(l-x 2 U 

Using the chain rule and simplifying we get, 



17.2 Implicit differentiation 


Using the same equation 


x 2 + y 2 = 1 

First, differentiate the individual terms of the equation: 

2x + 2yy' = 0 
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Separate the variables: 


2 yy' = —2x 


Divide both sides by 2 y, and simplify to get the same result as above: 


y = - 


2x 

2 y 


y' = 


X 

y 


17.3 Uses 

Implicit differentiation is useful when differentiating an equation that cannot be explicitly differen- 
tiated because it is impossible to isolate variables. 

For example, consider the equation. 


x 2 + xy + y 2 = 16 

Differentiate both sides of the equation (remember to use the product rule on the term xy) : 


Isolate terms with y’: 


2x + y + xy' + 2yy' = 0 


xy' + 2yy' = -2x-y 

Factor out a y’ and divide both sides by the other term: 

, -2x-v 

y = 

x + 2y 


17.4 Implicit Differentiation 

Generally, one will encounter functions expressed in explicit form, that is, y = fix) form. You might 
encounter a function that contains a mixture of different variables. Many times it is inconvenient or 
even impossible to solve for y. A good example is the function y 2 + 2yx + 3 = 5x. It is too cum- 
bersome to isolate y in this function. One can utilize implicit differentiation to find the derivative. 
To do so, consider y to be a nested function that is defined implicitly by x. You need to employ the 
chain rule whenever you take the derivative of a variable with respect to a different variable: i.e., - 4 - 
(the derivative with respect to x) of x is 1 ; 4^ of y is 
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Remember: 


dy dy du 
dx du dx 


Therefore: 


d q d q d V q 

Tx iy) = Ty iy) T^ 3y 


dy 

dx 


17.4.1 Examples 

Example 1 


xy = 1 


can be solved as: 


then differentiated: 



dy __J_ 
dx x 2 

However, using implicit differentiation it can also be differentiated like this: 


d d 

— lxy] = — [1] 
dx dx 


use the product rule: 


dy 

x—j— + y = 0 
dx 


solve for ^ : 


dy_ _ _y 
dx x 

Note that, if we substitute y=\ into ^ we end up with again. 

Example 2 
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Find the derivative of y 2 + x 2 = 25 with respect to x. 

You are seeking -A. 

Take the derivative of each side of the equation with respect to x 


d{y 2 + x 2 ) _ d{ 25) 
dx dx 

dy „ 

2 y • — — h 2x — 0 
dx 


2 y 


dx 


= -2x 


dy _ x 
dx y 


17.5 Inverse Trigonometric Functions 

Arcsine, arccosine, arctangent. These are the functions that allow you to determine the angle given 
the sine, cosine, or tangent of that angle. 

First, let us start with the arcsine such that: 


y = arcsin(x) 

To find dy/dx we first need to break this down into a form we can work with: 


x = sin(y) 


Then we can take the derivative of that: 


1 = cos(y) • 


dy 

dx 


...and solve for dy / dx: 


dy_ i_ 

dx cos(y) 

At this point we need to go back to the unit triangle. Since y is the angle and the opposite side is 
sin(y) (which is equal to x), the adjacent side is cos(y) (which is equal to the square root of 1 minus 
x 2 , based on the Pythagorean theorem), and the hypotenuse is 1. Since we have determined the 
value of cos(y) based on the unit triangle, we can substitute it back in to the above equation and get: 
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Derivative of the Arcsine 


d 1 

— arcsm(x) = , 
dx Vl-x 2 


We can use an identical procedure for the arccosine and arctangent: 


Derivative of the Arccosine 


d -1 

— arccos(x) = - 
dx Vl-x 2 


Derivative of the Arctangent 


— arctan(x) = 

dx l + x z 
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18 Derivatives of Exponential and Logarithm 
Functions 


18.1 Exponential Function 

First, we determine the derivative of e x using the definition of the derivative: 


d _ , e x+h -e x 

— e = hm 

dx /i— o h 


Then we apply some basic algebra with powers (specifically that a b + c = a b a 1 ): 


d 

dx 


e x = lim ■ 

/ 7 — »0 


e x e b — e x 


h 


Since e* does not depend on h, it is constant as h goes to 0. Thus, we can use the limit rules to move 
it to the outside, leaving us with: 


— e x = e x -lim 
dx h-> o 


e h - 1 
h 


Now, the limit can be calculated by techniques we will learn later, for example 
Calculus/Improper_Integrals#Definition L’Hopital’s rule 1 , and we will see that 


lim = 1 

fe— o h 


so that we have proved the following rule: 


Derivative of the exponential function 


d_ 

dx 


= e 


X 


1 http://en.wikibooks.org/wiki/ Calculus %2FImp rop e r_Int e gral s %2 3Def i nition%20L% 
27Hopital%27s%20rule 
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Now that we have derived a specific case, let us extend things to the general case. Assuming that a 
is a positive real constant, we wish to calculate: 


One of the oldest tricks in mathematics is to break a problem down into a form that we already know 
we can handle. Since we have already determined the derivative of e x , we will attempt to rewrite a x 
in that form. 

Using that e 111 ^ = c and that lnfa 41 ) = b • ln(a), we find that: 


a x = gX-ln (a) 


Thus, we simply apply the chain rule: 

— e xlnM = — [x-ln(a)] e x ' ln(a) = In (a)a x 
ax ax 


Derivative of the exponential function 


— a x = In (a) a x 
ax 


18.2 Logarithm Function 

Closely related to the exponentiation is the logarithm. Just as with exponents, we will derive the 
equation for a specific case first (the natural log, where the base is e), and then work to generalize it 
for any logarithm. 

First let us create a variable y such that: 


y = In (x) 

It should be noted that what we want to find is the derivative of y or ^ . 

Next we will put both sides to the power of e in an attempt to remove the logarithm from the right 
hand side: 


e y = x 

Now, applying the chain rule and the property of exponents we derived earlier, we take the derivative 
of both sides: 
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Logarithmic differentiation 


dy 

dx 


e y = i 


This leaves us with the derivative: 


dy _ J_ 
dx ey 


Substituting back our original equation of x - e v , we find that: 


Derivative of the Natural Logarithm 


d , 1 

— In (x) = - 
dx x 


If we wanted, we could go through that same process again for a generalized base, but it is easier 
just to use properties of logs and realize that: 


log fe (x) = 


ln(x) 

ln(fi) 


Since 1 / ln(fi) is a constant, we can just take it outside of the derivative: 


£ log ‘ (x)= tak£ 


ln(x) 


Which leaves us with the generalized form of: 


Derivative of the Logarithm 


d_ 

dx 


log ft (x) = 


1 

xln(fi) 


18.3 Logarithmic differentiation 

We can use the properties of the logarithm, particularly the natural log, to differentiate more dif- 
ficult functions, such a products with many terms, quotients of composed functions, or functions 
with variable or function exponents. We do this by taking the natural logarithm of both sides, 
re-arranging terms using the logarithm laws below, and then differentiating both sides implicitly, 
before multiplying through by y. 
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Derivatives of Exponential and Logarithm Functions 


lo g(f ) = log(fl") - 

log(a) -log(b) nlog(a) 


log(a) + 
log(fo) = 
log(afo) 


See the examples below. 

Example 1 

Suppose we wished to differentiate 


(6x 2 + 9) 2 
y \J 3x 3 — 2 

We take the natural logarithm of both sides 


ln(y) = In 


' (6x 2 + 9) 2 ' 
i \/3x 3 -2 > 


= ln(6x 2 + 9) 2 -ln(3x -2)2 
= 2 ln(6x 2 + 9) - - ln(3x 3 - 2) 


Differentiating implicitly 


1 dy 12x 1 9x 2 

= 2 x — x — 

y dx 6x 2 + 9 2 3x 3 -2 

24x 

6x 2 + 9 3x 3 - 2 

24x(3x 3 -2) — |x 2 (6x 2 + 9) 

(6x 2 + 9) (3x 3 — 2) 

Multiplying by y 


dy (6x 2 + 9) 2 24x(3x 3 — 2) - |x 2 (6x 2 + 9) 

dx y/3x 3 - 2 (6x 2 +9) (3x 3 -2) 


Example 2 
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Logarithmic differentiation 


Let us differentiate a function 


y = x x 

Taking the natural logarithm of left and right 

lny = ln(x x ) 

= xln(x) 

We then differentiate both sides, recalling the product rule 

1 dy 1 

— — = ln(x) + x- 
y ax x 

= ln(x) + 1 

Multiplying by the original function y 


^ =x x (ln(x) + l) 
ax 


Example 3 


Take a function 


y - x 


6cos(x) 


Then 


lny = ln(x 6cosW ) 

= 6cos(x) ln(x) 


We then differentiate 


1 dy 
y dx 


, 6cos(x) 

-6sin(x) ln(x) H 

x 


And finally multiply by y 


dy 

~dx = yl 


-6sin(x) ln(x) + 


6cos(x)' 

x , 


_ ^.6cos(x) 


, 6cos(x)' 

- 6 sin(x) ln(x) H 

x 
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Derivatives of Exponential and Logarithm Functions 


18.4 Exercises 


18.5 Find The Derivative By Definition 

Find the derivative of the following functions using the limit definition of the derivative. 

1. fix) = x 2 

2. f[x) =2x + 2 

3. fix) = \x 2 

4. /(x) =2x 2 +4x + 4 

5. fix) = v/jc + 2 

6. fix) = ± 

7 - /M = x+T 

8 - f {x]= vk 

9. fix) = ^ 

Solutions 2 


18.6 Prove Differentiation Rules 

Use the definition of the derivative to prove the following general rules: 

1. For any fixed real number c, -4- [c] = 0. 

2. For any fixed real numbers m and c, 4- [mx + c] = m 

3. For any fixed real number c, 4- [c/(x)] = cfj- y [/(x)]</div> 

4 - ^[/(x)±g(x)] = ^[/(x)]±^[g(x)] 

Solutions 3 


18.7 Find The Derivative By Rules 

Find the derivative of the following functions: 


18.7.1 Power Rule 

1. / (x) = 2x 2 + 4 

2. fix) = 3\fx 

3. fix) = 2x 5 + 8x 2 + x-78 

4. fix) = 7x 7 + 8x 5 + x 3 + x 2 -x 

5. fix) = jz +3x5 

6. fix) = 3x 15 + ^x 2 + ^ 


2 Chapter 62.5.1 on page 409 

3 Chapter 62.7 on page 410 
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Find The Derivative By Rules 


7. /(x) = J r -</x+x 

8. /(x) = 6x 1/3 - x 0 ' 4 + ^2 

9 - fW = ~^ + Vx 


18.7.2 Product Rule 

1 . /(x) = (x 4 + 4x + 2) (2x + 3) 

2. h(x) = (2x- l)(3x 2 + 2) 

3. /(x) = (x 3 - 12x)(3x 2 +2x) 

4. /(x) = (2 x 5 -x)(3x+1) 


18.7.3 Quotient Rule 


1. 

2 . 

3. 

4. 


/M = 
fix) = 

/(x) = 
d{u) = 


2x~t 1 
x+5 

3x 4 +2x+ 2 
3x 2 + l 

3 

X2+1 

x+2 
u 3 + 2 

77^ 


5. /(x) = £±f 

6. /(x) = X+1 

7. /(x) = 


2x 2 +2x+3 

16x 4 +2x 2 


18.7.4 Chain Rule 


1. /(x) = (x + 5) 2 

2. g(x) = (x 3 -2x + 5) 2 

3. /(x) = \/l-x 2 
4- /(x) = 

5. fix) = (2x + 1) \j2x + 2 
6 ‘ /W = ^ _ 

7. /(x) = v / 2x 2 TT(3x 4 + 2x) 2 

8. /(x) = (x 4 +4x+2 )2 

9. /(x) - V x 3 + 1 (x 2 — 1) 

10. /(x) = (( 2x + 3) 4 + 4(2x + 3) + 2) 2 

11 . fix) = Vl + X 2 


18.7.5 Exponentials 

1. fix) = (3x 2 + e)e 2x 

2. fix) - e 2x2+3x 

3. fix) = e e2x2+1 

4. / (x) = 4 X 
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Derivatives of Exponential and Logarithm Functions 


18.7.6 Logarithms 

1. fix ) = 2 X_3 • 3\/ x 3 — 2 + lnx 

2. fix) = lnx-2e x + \fx 

3. f{x) = ln(ln(x 3 (x + 1))) 

4. fix) = ln(2x 2 + 3x) 

5. fix) = log 4 x + 2lnx 


18.7.7 Trig Functions 

1. fix) = 3e x -4cos(x) - \\nx 

2. fix) = sin(x) + cos(x) 

Solutions 4 


18.8 More Differentiation 


Given the above rules, practice differentiation on the following. 


3. 

4. 


1. ^[(x 3 + 5) 10 ] 

2. fjj [x 3 + 3x] 

^[(x + 4)-(x + 2)-(x- 

d r x± 1 1 

dx * 3x 2 

5. 

6. [sinx- x 4 ] 

7 - St 2 "] 

8. £.[e x l 


3)] 


dx L 
jL [e 2- r 
dx le 


9. 

10. ^[(6x) cos(x)+1 ] 


18.8.1 Answers 

1. 30x 2 (x 3 + 5) 9 

2. 3x 2 + 3 

3 . (x — 3) (x + 2) + (x + 4) (x + 2) + (x — 3) (x + 4) 


6. cos(x)x 4 + sin(x) -4x 3 

7. ln(2)-2 x 

8. 2xe* 2 

9. ln(2)-2 x e 2 ' 

10. 6x cos(x)+1 • [- sin(x) ln(x) + cos(x)+1 ] 


4 Chapter 62.8 on page 41 1 
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Implicit Differentiation 


18.9 Implicit Differentiation 

Use implicit differentiation to find y’ 

1. x 2 + y 2 = 1 

2. x 3 + y 3 = xy 

3. (2x + y) 4 + 3x 2 + 3y 2 = j + 1 
Solutions 5 


18.10 Logarithmic Differentiation 

Use logarithmic differentiation to find 

1. y = x(Vl -x 3 ) 

2. y = 

3. y = (2 x) 2x 

4. y = (x 3 +4x) 3x+1 

Solutions 6 


18.11 Equation of Tangent Line 



For each function, /, (a) determine for what values of x the tangent line to / is horizontal and (b) 
find an equation of the tangent line to / at the given point. 

1. /(x) = y + x 2 +5, (3,23) 

2. /(x) = x 3 -3x+ 1, (1,-1) 

3. /(x) = |x 3 + x 2 - 12x + 6, (0,6) 

4. /(x) = 2x+^=, (1,3) 

5. /(x) = (x 2 +1)(2-x), (2,0) 

6. /(x) = |x 3 + fx 2 + 2x+ 1, (3,/(3)) 

7. Find an equation of the tangent line to the graph defined by (x-y- 1) 3 = x at the point (1,-1). 

8. Find an equation of the tangent line to the graph defined by e xy + x 2 = y 2 at the point (1,0). 


18.12 Higher Order Derivatives 


1. What is the second derivative of 3x 4 + 3x 2 + 2x? 
Solutions 7 


5 Chapter 62. 10 on page 412 

6 Chapter 62.9 on page 412 

7 Chapter 62. 1 1 on page 412 
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Derivatives of Exponential and Logarithm Functions 


18.13 Relative Extrema 


Find the relative maximum(s) and minimum(s), if any, of the following functions. 


1 . 

2 . 

3. 

4. 

5. 

6 . 


m = ^ 

/(x) = (x- 1) 2/3 

/W = X 2 + \ 

f ^ = I+F 
fix) = x 2 -4x + 9 


fix ) = 


x 2 +x+l 
-JC+1 


18.14 Range of Function 


1 . Show that the expression x+llx cannot take on any value strictly between 2 and -2. 


18.15 Absolute Extrema 

Determine the absolute maximum and minimum of the following functions on the given domain 

1. fix) = ^x 3 - ^x 2 + 1 on [0,3] 

2. fix) = xVl - x 2 on [-1,1] 

3. fix) = (|x 2 - l)x on [-1/2,2] 


18.16 Determine Intervals of Change 


Note: There are currently no answers given for these exercises. 

Find the intervals where the following functions are increasing or decreasing 

1. 10-6x-2x 2 

2. 2x 3 -12x 2 +18x+15 

3. 5+36x+3x 2 -2x 3 

4. 8+36x+3x 2 -2x 3 

5. 5x 3 -15x 2 -120x+3 

6. x 3 -6x 2 -36x+2 


18.17 Determine Intervals of Concavity 


Find the intervals where the following functions are concave up or concave down 

1. 10-6x-2x 2 

2. 2x 3 -12x 2 +18x+15 

3. 5+36x+3x 2 -2x 3 

4. 8+36x+3x 2 -2x 3 
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Word Problems 


5. 5x 3 -15x 2 -120x+3 

6. x 3 -6x 2 -36x+2 


18.18 Word Problems 

1. You peer around a corner. A velociraptor 64 meters away spots you. You run away at a speed 
of 6 meters per second. The raptor chases, running towards the corner you just left at a speed 
of 4/ meters per second (time t measured in seconds after spotting). After you have run 4 
seconds the raptor is 32 meters from the corner. At this time, how fast is death approaching 
your soon to be mangled flesh? That is, what is the rate of change in the distance between 
you and the raptor? 

2. Two goldcarts leave an intersection at the same time. One heads north going 12 mph and the 
other heads east going 5 mph. How fast arc the cars getting away from each other after one 
hour? 

3. You’re making a can of volume 200 m 3 with a gold side and silver top/bottom. Say gold costs 
10 dollars per m 2 and silver costs 1 dollar per m 2 . What’s the minimum costs of such a can? 


18.19 Graphing Functions 


For each of the following, graph a function that abides by the provided characteristics 

1. /( 1) = /(— 2) = 0, lim^oo/A) = lim^-oo/fx) = 0, vertical asymptote at x = -3, fix) > 
0 on (0,2), /'(x) < Oon (-oo,-3) U (-3,0) u (2,oo), f"{x) > 0 on (-3,1) U (3,oo), /"(x) < 
0 on (-oo, -3) U (1,3). 

2. / has domain [-1, 1], /(- 1) = -1, /(-|) = -2, = 0, fix) > 0 on (-1, 1) 


145 



Derivatives of Exponential and Logarithm Functions 


146 



19 Applications of Derivatives 
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Applications of Derivatives 


148 



20 Extrema and Points of Inflection 



Local Maximum 


Global Maximum 


Global Minimum^ 


Local Minimum 


Figure 32: The four types of extrema. 

Maxima and minima are points where a function reaches a highest or lowest value, respectively. 
There are two kinds of extrema (a word meaning maximum or minimum): global and local, some- 
times referred to as "absolute" and "relative", respectively. A global maximum is a point that takes 
the largest value on the entire range of the function, while a global minimum is the point that takes 
the smallest value on the range of the function. On the other hand, local extrema are the largest or 
smallest values of the function in the immediate vicinity. 

All extrema look like the crest of a hill or the bottom of a bowl on a graph of the function. A global 
extremum is always a local extremum too, because it is the largest or smallest value on the entire 
range of the function, and therefore also its vicinity. It is also possible to have a function with no 
extrema, global or local: y=x is a simple example. 

At any extremum, the slope of the graph is necessarily zero, as the graph must stop rising or falling 
at an extremum, and begin to head in the opposite direction. Because of this, extrema arc also 
commonly called stationary points or turning points. Therefore, the first derivative of a function 
is equal to zero at extrema. If the graph has one or more of these STATIONARY POINTS 1 , these may 
be found by setting the first derivative equal to zero and finding the roots of the resulting equation. 


1 http://en.wikipedia.org/wiki/stationary%20point 
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Extrema and Points of Inflection 



Figure 33: The function f(x)=x 3 , which contains a point of inflexion at the point (0,0). 


Flowever, a slope of zero does not guarantee a maximum or minimum: there is a third class of 
stationary point called a POINT OF INFLEXION 2 (also spelled point of inflection). Consider the 
function 


fix ) = x 3 


The derivative is 


f (x) = 3x 2 

The slope at x=0 is 0. We have a slope of zero, but while this makes it a stationary point, this doesn’t 
mean that it is a maximum or minimum. Looking at the graph of the function you will see that x=0 
is neither, it’s just a spot at which the function flattens out. True extrema require the a sign change in 
the first derivative. This makes sense - you have to rise (positive slope) to and fall (negative slope) 
from a maximum. In between rising and falling, on a smooth curve, there will be a point of zero 
slope - the maximum. A minimum would exhibit similar properties, just in reverse. 


2 http://en.wikipedia.org/wiki/Point%20of%20inflexion 
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The Extremum Test 



Figure 34: Good ( B and C, green) and bad ( D and E, blue) points to check in order to classify the 
extremum (A black). The bad points lead to an incorrect classification of A as a minimum. 


This leads to a simple method to classify a stationary point - plug x values slightly left and right into 
the derivative of the function. If the results have opposite signs then it is a true maximum/ mi n im um. 
You can also use these slopes to figure out if it is a maximum or a minimum: the left side slope will 
be positive for a maximum and negative for a minimum. However, you must exercise caution with 
this method, as, if you pick a point too far from the extremum, you could take it on the far side of 
another extremum and incorrectly classify the point. 


20.1 The Extremum Test 

A more rigorous method to classify a stationary point is called the extremum test, or 2nd Deriva- 
tive Test. As we mentioned before, the sign of the first derivative must change for a stationary point 
to be a true extremum. Now, the second derivative of the function tells us the rate of change of the 
first derivative. It therefore follows that if the second derivative is positive at the stationary point, 
then the gradient is increasing. The fact that it is a stationary point in the first place means that this 
can only be a minimum. Conversely, if the second derivative is negative at that point, then it is a 
maximum. 

Now, if the second derivative is zero, we have a problem. It could be a point of inflexion, or it could 
still be an extremum. Examples of each of these cases are below - all have a second derivative equal 
to zero at the stationary point in question: 

• y = x 3 has a point of inflexion at x = 0 

• y = x 4 has a minimum at x = 0 

• y = -x 4 has a maximum at x = 0 

However, this is not an insoluble problem. What we must do is continue to differentiate until we 
get, at the (n+1 )th derivative, a non- zero result at the stationary point: 

f (x) = 0, f" (x) = 0, . . . , f [n) (x) = 0, / ( " +1) (x) ? 0 
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Extrema and Points of Inflection 


If n is odd, then the stationary point is a true extremum. If the (n+1 ) th derivative is positive, it is 
a minimum; if the (n+1 ) th derivative is negative, it is a maximum. If n is even, then the stationary 
point is a point of inflexion. 

As an example, let us consider the function 


fix ) = -x 4 

We now differentiate until we get a non-zero result at the stationary point at x-0 (assume we have 
already found this point as usual): 


f (x) = — 4x 3 


f" (x) - -12x 2 


/'" (x) = — 24x 


/ (4) (x) = -24 

Therefore, (n+1) is 4, so n is 3. This is odd, and the fourth derivative is negative, so we have a 
maximum. Note that none of the methods given can tell you if this is a global extremum or just a 
local one. To do this, you would have to set the function equal to the height of the extremum and 
look for other roots. 

See "Optimization 3 " for a common application of these principles. 


3 Chapter 24 on page 167 
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21 Newton’s Method 


Newton’s Method (also called the Newton-Raphson method) is a recursive algorithm for approx- 
imating the root of a differentiable function. We know simple formulas for finding the roots of 
lineal - and quadratic equations, and there are also more complicated formulae for cubic and quartic 
equations. At one time it was hoped that there would be formulas found for equations of quintic and 
higher-degree, though it was later shown by Neils Henrik Abel 1 that no such equations exist. 
The Newton-Raphson method is a method for approximating the roots of polynomial equations of 
any order. In fact the method works for any equation, polynomial or not, as long as the function is 
differentiable in a desired interval. 

ctrxtd style= "background-color: #FFFFFF; border: solid lpx#D6D6FF; padding: lem; margin: 
Opx auto;" valign=top> 


Newton’s Method 


Let fix ) be a differentiable function. Select a point x\ based on a first approximation to the root, 
arbitrarily close to the function’s root. To approximate the root you then recursively calculate using: 

Xn+1 — Xri 

f'ixn) 

As you recursively calculate, the x„’s become increasingly better approximations of the function’s 
root. 

For n number of approximations. 


x n — 


n 


*o -L 

i .= 0 


f(Xj) 

f'(Xi) 


In order to explain Newton’s method, imagine that xo is already very close to a zero of fix). We 
know that if we only look at points very close to xo then fix) looks like it’s tangent line. If xo was 
already close to the place where fix) was zero, and near xo we know that fix) looks like its tangent 
line, then we hope the zero of the tangent line at xo is a better approximation then xo itself. 

The equation for the tangent line to fix) at xo is given by 


y = /'(xo)(x-x 0 ) + /(x 0 ). 


1 http://en.wikipedia.org/wiki /Ni els%20Henri k%20Abel 
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Newton’s Method 


Now we set y = 0 and solve for x. 


0 = /'(Xo)(X-Xo) + f{x o) 


-fix o) = fix o)(X-Xo) 


-fix o) 
fix o) 


= (X-X 0 ) 


-fix o) 

X = — h Xo 

fix o) 


This value of x we feel should be a better guess for the value of x where fix) = 0. We choose to call 
this value of Xi , and a little algebra we have 


Xi = x 0 - 


fix o) 
fix o)' 


If our intuition was coiTect and x\ is in fact a better approximation for the root of fix), then our 
logic should apply equally well at xi. We could look to the place where the tangent line at xi is 
zero. We call X 2 , following the algebra above we arrive at the formula 


X2 = Xi - 


fix l) 
fiXif 


And we can continue in this way as long as we wish. At each step, if your current approximation is 
x n our new approximation will be x n+ i = x n - ■ 


21.1 Examples 


Find the root of the function f(x) = x 2 . 


X! = /( 2) = 4 


X 2 = Xi - 


fix id 

fixi) 


2 


X 3 = X 2 - 


f(X2) 

fix 2 ) 


1 


X 4 = x 3 - 


fix 3 ) 

fix 3 ) 


1 

2 
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Notes 


X5 = X4 - 


fix 4 ) 
/'(x 4 ) 


1 

4 


x 6 = x 5 - 


fix 5) 
/'(x 5 ) 


1 

8 


X7 = X6 - 


/Ug) 

/'(x 6 ) 


1 

16 


x 8 = x 7 - 


/(x 7 ) 

/'(*?) 


1 

32 


As you can see x n is gradually approaching zero (which we know is the root of f(x)). One can 
approach the function’s root with arbitrary accuracy. 


Answer: /(x) = x 2 has a root at x = 0 . 


21.2 Notes 

This method fails when fix) = 0. In that case, one should choose a new stalling place. Occasionally 
it may happen that fix ) = 0 and fix) = 0 have a common root. To detect whether this is true, we 
should first find the solutions of fix) = 0 , and then check the value of fix) at these places. 

Newton’s method also may not converge for every function, take as an example: 


fix) = 


y/x- r, forx > r 
-y/r-x, forx<r 


For this function choosing any x\ = r-h then X 2 = r + h would cause successive approximations to 
alternate back and forth, so no amount of iteration would get us any closer to the root than our first 
guess. 


21.3 See Also 

• Wikipedia:Newton’s METHOD 2 

• Wikipedia:Abel-Ruffini theorem 3 

category: Newton Method 4 


2 http ://en. wikipedia.org/wiki/Newton%27s%20method 

3 http ://en. wikipedia.org/wiki/Abel%20%13Ruffini%20theorem 

4 http://commons.wikimedia.org/wiki/category: NewtonMethod 
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Newton’s Method 
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22 Related Rates 


22.1 Introduction 

Process for solving related rates problems: 

• Write out any relevant formulas and information. 

• Take the derivative of the primary equation with respect to time. 

• Solve for the desired variable. 

• Plug-in known information and simplify. 


22.2 Related Rates 

As stated in the introduction, when doing related rates, you generate a function which compares 
the rate of change of one value with respect to change in time. For example, velocity is the rate 
of change of distance over time. Likewise, acceleration is the rate of change of velocity over time. 
Therefore, for the variables for distance, velocity, and acceleration, respectively x, v, and a, and 
time, t: 


dx 

dt 


dv 
a = — 
dt 

Using derivatives, you can find the functions for velocity and acceleration from the distance func- 
tion. This is the basic idea behind related rates: the rate of change of a function is the derivative of 
that function with respect to time. 


22.3 Common Applications 

22.3.1 Filling Tank 

This is the easiest variant of the most common textbook related rates problem: the filling water tank. 

• The tank is a cube, with volume 1000L. 

• You have to fill the tank in ten minutes or you die. 

• You want to escape with your life and as much money as possible, so you want to find the smallest 
pump that can finish the task. 
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Related Rates 


We need a pump that will fill the tank 1000L in ten minutes. So, for pump rate p, volume of water 
pumped v, and minutes t: 



22.4 Examples 


Related rates can get complicated very easily. 

Example 1: 

A cone with a circular base is being tilled with water. Find a formula which will find the rate with 
which water is pumped. 

• Write out any relevant formulas or pieces of information. 

V = -nr 2 h 
3 

• Take the derivative of the equation above with respect to time. Remember to use the Chain 
Rule 1 and the Product Rule 2 . 


1 2 , 

V = -nr 2 h 
3 

dV n , dh dr\ 

— — = - r 2 - — + 2rh- — \ 
dt 3 V dt dt) 

Answer: = f (>' 2 -^+2 rh-%) 

Example 2: 

A spherical hot air balloon is being filled with air. The volume is changing at a rate of 2 cubic feet 
per minute. 

How is the radius changing with respect to time when the radius is equal to 2 feet? 

• Write out any relevant formulas and pieces of information. 


V sphere 



dV 

dt 


= 2 


1 http://en.wikibooks.org/wiki /Calculus %2FMore_D iffe rental i on_Rule s%23Chai n% 
20Rule 

2 http://en.wikibooks.org/wiki /Calculus %2FMore_D i fferent i at ion_Rule s%2 3Product% 
20Rule 
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Examples 


r = 2 

• Take the derivative of both sides of the volume equation with respect to time. 



If \=\r^nr 2 -%\-\\=\^-%\-\\} 

• Solve for 


dr 1 dV 
dt Anr 2 dt 

• Plug-in known information. 


dr 1 

— = 2 

dt 16 n 


Answer: % = ± ft/min. 

Example 3: 

An airplane is attempting to drop a box onto a house. The house is 300 feet away in horizontal 
distance and 400 feet in vertical distance. The rate of change of the horizontal distance with respect 
to time is the same as the rate of change of the vertical distance with respect to time. How is the 
distance between the box and the house changing with respect to time at the moment? The rate of 
change in the horizontal direction with respect to time is -50 feet per second. 

Note: Because the vertical distance is downward in nature, the rate of change ofy is negative. Sim- 
ilarly, the horizontal distance is decreasing, therefore it is negative (it is getting closer and closer). 

The easiest way to describe the horizontal and vertical relationships of the plane’s motion is the 
Pythagorean Theorem. 

• Write out any relevant formulas and pieces of information. 


(where 5 is the distance between the plane and the house) 

x = 300 


y - 400 
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= Jx 2 + y 2 = V 300 2 + 400 2 = 500 


dx dy 
— = -f- = -50 
cit ctt 

Take the derivative of both sides of the distance formula with respect to time. 


2 2 2 
jr + jr = 


Solve for 


Plug-in known information 


dx dy ds 

2x v2y -2 s 

d t dt dt 


ds _ 1 
dt 2s 


1 dx dy 
2x • — — 1 2 y • — — 

,7 4- •' *1-4- 


dt 


dt 


l= l^o) [2(300) -(- 5 °) + 2(400) • (-50)] I- I 1= 1^1-70000) I- I 1= 1-70 ft/s I- I 1} 

Answer: jf t = -70 ft/sec. 

Example 4: 

Sand falls onto a cone shaped pile at a rate of 10 cubic feet per minute. The radius of the pile’s base 
is always 1/2 of its altitude. When the pile is 5 ft deep, how fast is the altitude of the pile increasing? 

• Write down any relevant formulas and information. 



h 



r = 


-h 

2 


h = 5 


Substitute r = into the volume equation. 
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{I 

\V 1= \^nr 2 h I- I 1= \^nh- I- 1 1= l-^nh 3 1-11} 

• Take the derivative of the volume equation with respect to time. 


V - — nh 3 
12 


d V 1 


dh 


— — = -nh ■ — — 


dt 


dt 


Solve for 


dh 

dt 


• Plug-in known information and simplify. 


4 dV 
nh 2 dt 


'Sl=lsSF-10l-ll=|Aft/minl-"l 
Answer: ^7 = ^ ft/min. 

Example 5: 

A 10 ft long ladder is leaning against a vertical wall. The foot of the ladder is being pulled away 
from the wall at a constant rate of 2 ft/sec. When the ladder is exactly 8 ft from the wall, how fast is 
the top of the ladder sliding down the wall? 

• Write out any relevant formulas and information. 

Use the Pythagorean Theorem to describe the motion of the ladder. 

x 2 + y 2 - l 2 

(where l is the length of the ladder) 


l = 10 


dx 

dt 


x = 8 


y=Vl 2 ~x 2 = v 7 100 — 64 = \/36 = 6 
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• Take the derivative of the equation with respect to time. 


( 


dx „ dy n 
2x • — — i- 2 y • — — — 0 
dt dt 


l 2 


so 

•) 

• S ° lvefor s- 



dx dy 

2x ■ — — + 2y • — = 0 
dt dt 


dy dx 

2 y -ir- 2x u 


dy _ x dx 
dt y dt 


• Plug-in known information and simplify. 


lgl=l(— 1)(2) I- 1 1=1- 1 ft/sec 1-11} 
Answer: ^ = - 1 ft/sec. 


22.5 Exercises 

22.5.1 Problem Set 

Here’s a few problems for you to try: 

1 . A spherical balloon is inflated at a rate of 100 /rVmin. Assuming the rate of inflation remains 
constant, how fast is the radius of the balloon increasing at the instant the radius is 4 ft? 

2. Water is pumped from a cone shaped reservoir (the vertex is pointed down) 10 ft in diameter 
and 10 ft deep at a constant rate of 3 f t'Vmin. How fast is the water level falling when the 
depth of the water is 6 ft? 

3. A boat is pulled into a dock via a rope with one end attached to the bow of a boat and the 
other end held by a man standing 6 ft above the bow of the boat. If the man pulls the rope at 
a constant rate of 2 ft/sec, how fast is the boat moving toward the dock when 10 ft of rope is 
out? 
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22.5.2 Solution Set 


1 25 ft 

‘ 16;r min 

2 1 

‘ 3 n min 

o 5 ft_ 

‘ 2 sec 
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23 Kinematics 


23.1 Introduction 

Kinematics or the study of motion is a very relevant topic in calculus. 

If xis the position of some moving object, and t is time, this section uses the following conventions: 

• x(f) is its position function 

• v{t) = x'[t) is its VELOCITY 1 function 

• a(t) = x"{t) is its ACCELERATION 2 function 


23.2 Differentiation 

23.2. 1 Average Velocity and Acceleration 

Average velocity and acceleration problems use the algebraic definitions of velocity and accelera- 
tion. 

u avg - Af 
u avg ~ Af 

Examples 
Example 1: 


A particle's position is defined by the equation = f 5 — 2f 2 + t . Find the 
average velocity over the interval [2,7]. 


• Find the average velocity over the interval [2, 7]\ 


I V avg 1= \ Xi7) 7 ~_f 2) I- I 1= I- I 1= 150 I- I 1} 


Answer: u aV g = 50 . 


1 http ://en. wikipedia.org/wiki/velocity 

2 http ://en. wikipedia.org/wiki/acceleration 
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Kinematics 


23.2.2 Instantaneous Velocity and Acceleration 


Instantaneous velocity and acceleration problems use the derivative definitions of velocity and ac- 
celeration. 


aW = % 


Examples 
Example 2: 


A particle moves along a path with a position that can be determined 
by the function x[t) = 4t 3 + e t . 

Determine the acceleration when t= 3. 


• Find v{t) = 


Find all ) = % = %. 


• Find a{ 3 ) = j^\ t=3 


d s o f 
— = 12 t 2 + e l 
dt 


d 2 s 

dt 2 


= 24f + e 


|^|| f=3 |= |24(3) + e 3 I- 1 1= 172 + e 3 1-11=192.08553692... 1-11} 


Answer: a{3) = 92.08553692... 


23.3 Integration 


• x 2 -X]= f t ^ v{t)dt 

• v 2 -v\ = a{t)dt 
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24 Optimization 


24.1 Introduction 

Optimization is one of the uses of Calculus in the real world. Let us assume we are a pizza parlor 
and wish to maximize profit. Perhaps we have a flat piece of cardboard and we need to make a box 
with the greatest volume. How does one go about this process? 

Obviously, this requires the use of maximums and minimums. We know that we find maximums 
and minimums via derivatives. Therefore, one can conclude that Calculus will be a useful tool for 
maximizing or minimizing (also known as "Optimizing") a situation. 


24.2 Examples 

24.2.1 Volume Example 

A box manufacturer desires to create a box with a surface area of 100 inches squared. What is the 
maximum size volume that can be formed by bending this material into a box? The box is to be 
closed. The box is to have a square base, square top, and rectangular sides. 

• Write out known formulas and information 


Abase ~ X 


Aside — X' h 


Atotal - 2x 2 + 4x • /r = 100 


V = l- w-h = x 2 -h 

• Eliminate the variable h in the volume equation 

2x 2 + 4xh = 100 


x 2 + 2 xh = 50 
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Examples 


24.2.2 Sales Example 



Figure 35 


A small retailer can sell n units of a product for a revenue of r(n) = 8.1 n and at a cost of cin) = 
n ' -7 n 2 + 18n, with all amounts in thousands. Flow many units does it sell to maximize its profit? 

The retailer’s profit is defined by the equation pin) = r(n) - cin), which is the revenue generated 
less the cost. The question asks for the maximum amount of profit which is the maximum of the 
above equation. As previously discussed, the maxima and minima of a graph are found when the 
slope of said graph is equal to zero. To find the slope one finds the derivative of pin). By using the 
subtraction rule p’in) = r'iri) - c'{n): 

pin) =rin)-cin) 

p'in) =d_ [8An] _d_[ n 3_ 7n 2 + 18n] 

= -3 n 2 + 14n-9.9 
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Therefore, when -3 n 2 + 14 n - 9.9 =0 the profit will be maximized or minimized. Use the 
QUADRATIC FORMULA 1 to find the roots, giving {3.798,0.869}. To find which of these is the 
maximum and minimum the function can be tested: 

p{ 0.869) = -3.97321, p(3.798) = 8.58802 

Because we only consider the functions for all n > 0 (i.e., you can’t have n = — 5 units), the only 
points that can be minima or maxima are those two listed above. To show that 3.798 is in fact a 
maximum (and that the function doesn’t remain constant past this point) check if the sign of p'{n ) 
changes at this point. It does, and for n greater than 3.798 P'{n) the value will remain decreasing. 
Finally, this shows that for this retailer selling 3.798 units would return a profit of $8,588.02. 


1 http://en.wikipedia.org/wiki/quadratic%20formula 
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25 Extreme Value Theorem 


Extreme Value Theorem 

If /is a continuous function and closed on the interval [a, b], then / has both a minimum and a 
maximum. 

This introduces us to the aspect of global extrema and local extrema. (Also known as absolute 
extrema or relative extrema respectively.) 

How is this so? Let us use an example. 

fix ) = x 2 and is closed on the interval [-1,2], Find all extrema. 



A critical point (a point where the derivative is zero) exists at (0,0). Just for practice, let us use the 
second derivative test to evaluate whether or not it is a minimum or maximum. (You should know 
it is a minimum from looking at the graph.) 


d 2 y 

— - = 2 
dx 2 

f"{c) > 0, thus it must be a minimum. 

As mentioned before, one can find global extrema on a closed interval. How? Evaluate the y 
coordinate at the endpoints of the interval and compare it to the y coordinates of the critical point. 
When you are finding extrema on a closed interval it is called a local extremum and when it’s for 
the whole graph it’s called a global extremum. 

1: Critical Point: (0,0) This is the lowest value in the interval. Therefore, it is a local minimum 
which also happens to be the global minimum. 

2: Left Endpoint (-1, 1) This point is not a critical point nor is it the highest/lowest value, therefore 
it qualities as nothing. 

3: Right Endpoint (2, 4) This is the highest value in the interval, and thus it is a local maximum. 

This example was to show you the extreme value theorem. The quintessential point is this: on a 
closed interval, the function will have both minima and maxima. However, if that interval was an 
open interval of all real numbers, (0,0) would have been a local minimum. On a closed interval, 
always remember to evaluate endpoints to obtain global extrema. 
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Extreme Value Theorem 


25.1 First Derivative Test 

Recall that the first derivative of a function describes the slope of the graph of the function at every 
point along the graph for which the function is defined and differentiable. 

Increasing/Decreasing: 

• If fix) < 0 , then fix) is decreasing. 

• If fix) > 0 , then fix) is increasing. 

Local Extrema: 

• If x=c = fic) = 0 and fix) changes signs at x = c , then there exists a local extremum at 
x= c . 

• If fix) <0 for x < c and fix') >0 for x > c , then /(c) is a local minimum. 

• If fix) >0 for x < c and fix)< 0 x > c , then /(c) is a local maximum. 


Example 1: 


Let /(x) = 3x 2 + 4x - 5 . Find all local extrema. 


Find S 


fix) = 3x 2 + 4x - 5 


Set ^ = 0 to find local extrema. 


f\x) = 6x + 4 


6x + 4 = 0 


6x = —4 


x = - 


2 

3 


• Determine whether there is a local minimum or maximum at x = 


Choose an x value smaller than 


2 

3 


//— 1) = 6(— 1) + 4 = — 2 < 0 
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Second Derivative Test 


Choose an x value larger than 


2 

3 


f'(l) = 6 ( 1 ) + 4 = 10 >0 

Therefore, there is a local minimum at x = -| because /'(- 1) = 0 and f'(x) changes signs at 


o 

Answer: local minimum: x=— 3 . 


25.2 Second Derivative Test 


Recall that the second derivative of a function describes the concavity of the graph of that function. 

d 2 v 

• If f \ x=c = f"(c) = 0 and f"{c) changes signs at x = c , then there is a point of inflection (change 
in concavity) at x = c . 

• If fix) < 0 , then the graph of f (x) is concave down. 

• If fix) > 0 , then the graph of f (x) is concave up. 


Example 2: 


Let /(x) = x 3 + 2x + 7 . Find any points of inflection on the graph of /(x) . 


Filld S' 


f{x) = x 3 + 2 x + 7 


fix) =3 x 2 + 2 


0 = 0 - 


fix) = 6x 


6x = 0 


x = 0 
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Extreme Value Theorem 


• Determine whether f”(x) changes signs at x = 0 . 
Choose an x value that is smaller than 0: 


/"(— 1 ) = 6 (— 1 ) = — 6 < 0 
Choose an x value that is larger than 0: 

/"( l) = 6(1) = 6 > 0 

Therefore, there exists a point of inflection at x = 0 because /"( 0) = 0 and f"(x) changes signs at 
x = 0 . 


Answer: point of inflection: x = 0 . 


Extreme value theorem 1 


1 http://en.wikipedia.org/wiki/Extreme%20value%20theorem 
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26 Rolled Theorem 


Rolle’s Theorem 

If a function, fix) , is continuous on the closed interval [a, b] , is differentiable on the open interval 
ia,b) , and / ( a) = f{b) , then there exists at least one number c, in the interval (a, b) such that 
/'(c) - 0 . 


Rolle’s Theorem is important in proving the Mean Value Theorem 1 . 


26.1 Examples 



a c b 


Figure 36 

Example: 

fix) = x 2 -3x. Show that Rolle’s Theorem holds true somewhere within this function. To do so, 
evaluate the x-intercepts and use those points as your interval. 


1 Chapter 27 on page 177 
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Solution: 

1: The question wishes for us to use the x-intercepts as the endpoints of our interval. 

Factor the expression to obtain x(x- 3) = 0. x = 0 and x = 3 are our two endpoints. We know that 
J[ 0) and fi3) are the same, thus that satisfies the first paid of Rolle’s theorem (J(a) = j{b)). 

2: Now by Rolle’s Theorem, we know that somewhere between these points, the slope will be zero. 
Where? Easy: Take the derivative. 

dy_ 

dx 

= 2x — 3 

Thus, at x = 3/2, we have a spot with a slope of zero. We know that 3/2 (or 1.5) is between 0 and 3. 
Thus, Rolle’s Theorem is true for this (as it is for all cases). This was merely a demonstration. 
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27 Mean Value Theorem 


1. REDIRECT Calculus/Mean Value Theorem 1 

27.1 Exercises 

See the exercises for Differentiation. 


1 http://en.wikibooks.org/wiki/ Calculus %2FMean%20Value%20Theorem 
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28 Integration 


28.1 Integration 1 



28. 1 . 1 Basics of Integration 

• Definite integral 2 

• Indefinite integral 3 

• Notes on Integration Techniques 4 


1 Chapter 2 8 on page 179 

2 http://en.wikibooks.org/wiki/Calculus%2FDefinite%20integral 

3 http://en.wikibooks.org/wiki/ Calc ulus %2FIndef in i te_inte gral 

4 http://en.wikibooks.org/wiki /Calculus %2FTe chn i que s_of_Integrat i on 
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• Fundamental Theorem of Calculus 5 


28.1.2 Integration techniques 

• Infinite Sums 6 

• Derivative Rules and the Substitution Rule 7 

• Integration by Parts 8 

• Complexifying 9 

• Rational Functions by Partial Fraction Decomposition 10 

• Trigonometric Substitutions 11 

• Tangent Half Angle Substitution 12 

• Trigonometric Integrals 13 

• Reduction Formula 14 

• Irrational Functions 15 

• Numerical Approximations 16 

• Integration techniques 17 

• Improper integrals 18 

• Exercises 19 


5 http://en.wikibooks.org/wiki/Calculus%2FFundamental%20Theorem%20of%20Calculus 

6 Chapter 2 8 . 2 on page 182 

7 Chapter 28.4 on page 183 

8 Chapter 28.7 on page 187 

9 Chapter 2 8 . 9 on page 190 

10 Chapter 28.10 on page 191 

11 Chapter 28.1 1 on page 193 

12 http : / /en . wikibooks . org/wiki/Calculus%2FIntegration%20techniques%2FTangent% 
20Half%20Angle 

13 Chapter 28.12 on page 200 

14 Chapter 28.13 on page 204 

15 Chapter 28.14 on page 205 

16 Chapter 28.15 on page 207 

17 http://en.wikibooks.org/wiki /Calculus%2FIntegrat I ON_TECHN I QUE s 

18 Chapter 29 on page 209 

19 Chapter 29.1 on page 209 


180 



INTEGRATION' 


24 



t 




28.1.3 Applications of Integration 

• Area 20 

• Volume 21 

• Volume of solids of revolution 22 


20 Chapter 31 on page 213 

21 Chapter 32 on page 215 

22 Chapter 3 3 on page 221 
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• Arc length 23 

• Surface area 25 

• Work 26 

• Centre of mass 27 

• Pressure and force 28 

• Probability 29 


28.2 Infinite Sums 


The most basic, and arguably the most difficult, type of evaluation is to use the formal definition of 
a Riemann integral. 


28.3 Exact Integrals as Limits of Sums 

Using the definition of an integral, we can evaluate the limit as n goes to infinity. This technique 
requires a fairly high degree of familiarity with summation IDENTITIES 30 . This technique is often 
referred to as evaluation "by definition," and can be used to find definite integrals, as long as the 
integrands are fairly simple. We start with definition of the integral: 


I fa fix) dx 1= lim^oo ^pZ” =1 /(xp IThen picking x* to be x,- = a+i ^ we get, I- I 1= 
oo L” =1 f( a + i~, y L )- 1} In some simple cases, this expression can be reduced to a real 
number, which can be interpreted as the area under the curve if f(x) is positive on [a,b]. 


28.3.1 Example 1 

Find / 0 2 x 2 dx by writing the integral as a limit of Riemann sums. 


I Jo x 2 dx ^lim^oo ^pZ” =1 /(x*) I- I 1= lim^oo | L" =1 
lim I” =1 % !- 1 l= lim„_oo * 2 I- I 1= lim„^oo 

I- 1 1 = lim,,^ f + £ + §£? I- 1 1 = | 1 } 


n 2 i) I- I 1= lim^oof X" =1 (^) 2 1-11= 

^ n(w+1) 6 t2w+1) I- I Nlim^oo | ^±1 


In other cases, it is even possible to evaluate indefinite integrals using the formal definition. We can 
define the indefinite integral as follows: 


23 

25 

26 

27 

28 

29 

30 


Chapter 34 on page 223 
Chapter 35 on page 227 
Chapter 36 on page 229 
Chapter 37 on page 231 

http://en.wikibooks.org/wiki/Calculus%2FPressure%20and%20force 
http://en.wikibooks.org/wiki/Calculus%2FProbability 
http://en.wikipedia.org/wiki/Si gma_notat ion%2 3Ident I ties 
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Derivative Rules and the Substitution Rule 


I ffM dx 1= /o' /ft) dt = lirn„ ^ L". ,/!<,' 1 I- I 1= /(o+ I- I 1= 

28.3.2 Example 2 

Suppose fix ) = x 2 , then we can evaluate the indefinite integral as follows. 


iiiti/R?) i- ' 

Him ^ 


I / 0 /(£) df = Him^oo 
Him ^ 

I- I I- 1 lAlim„_„ (g + g + i) I 


llim n ^oo 
- I llim„^ 


L n ( xj_\ 3 

i = 1 l « J 


X 3 72(72+1) (2/1+1) 

OO „3 


't'} 


31 


llimn^oo 

- I llim„^ 


y n . 

^-;=l 


x 3 n(n+l)(2n+l) 

OO 


lx 3 Urn™ (5 + 271 + I- 1 l^ 3 • (5) I- 


28.4 Derivative Rules and the Substitution Rule 

After learning a simple list of antiderivatives, it is time to move on to more complex integrands, 
which are not at first readily integrable. In these first steps, we notice certain special case integrands 
which can be easily integrated in a few steps. 


28.5 Recognizing Derivatives and Reversing Derivative Rules 

If we recognize a function g(x) as being the derivative of a function fix), then we can easily express 
the antiderivative of g(x): 

/ g(x) dx = fix ) + C. 

For example, since 

J^sinx = cosx 

we can conclude that 

/ cosxdx = sinx+ C. 

Similarly, since we know e x is its own derivative, 

/ e x dx - e x + C. 

The power rule for derivatives can be reversed to give us a way to handle integrals of powers of x. 
Since 

d v n _ „ v n- 1 
dx x -nx 

we can conclude that 


31 http://en.wikibooks.org/wiki/ Category%3ACalculus%2 0%2 8book%2 9 
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f nx n 1 dx = x n + C, 
or, a little more usefully, 
f x n dx= ^ + C. 

28.6 Integration by Substitution 



WIKIVERSITY 


Figure 39: Integration by Substitution 32 


For many integrals, a substitution can be used to transform the integrand and make possible the 
finding of an antiderivative. There are a variety of such substitutions, each depending on the form 
of the integrand. 


28.6. 1 Integrating with the derivative present 

If a component of the integrand can be viewed as the derivative of another component of the inte- 
grand, a substitution can be made to simplify the integrand. 

For example, in the integral 


/ 


3x 2 (x 3 + l ) 5 dx 


we see that 3x 2 is the derivative of x 3 + 1. Letting 


u = x 3 + 1 


we have 



or, in order to apply it to the integral, 


32 http://en.wikiversity.org/wiki/Integration%20by%20Substitution 
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du = 3 x 2 dx 


With this we may write / 3x 2 (x 3 + l) 5 dx = f u 5 d u . = g u 6 + C = | (x 3 + l) 6 + C. 

Note that it was not necessary that we had exactly the derivative of u in our integrand. It would have 
been sufficient to have any constant multiple of the derivative. 

For instance, to treat the integral 


we may let u = x 5 . Then 


/ 


x 4 sin(x 5 ) dx 


and so 


du = 5 x 4 dx 


-du = x 4 dx 
5 


the right-hand side of which is a factor of our integrand. Thus, 


J x 4 sin(x 5 ) dx- J 


1 1 1 c 

- sin udu = --cosu + C = -- cos x +C. 
5 5 5 


In general, the integral of a power of a function times that function’s derivative may be integrated 
in this way. Since = g'(x), 

we have dx = ri ^ 4)1 . 


Therefore, / g'(x)[g(x)]" 


= /g'(x)[g(x)] w ^M 

= f[g{x)]"d[g{x)] 

_ [ g ( x )]” +1 
n + 1 


There is a similar rule for definite integrals, but we have to change the endpoints. 

Substitution rule for definite integrals Assume u is differentiable with continuous derivative and 
that /is continuous on the range of u. Suppose c = u{a ) and d = u{b). Then f(u{x))j^dx = 
f? f[u)du. 


28.6.2 Examples 

Consider the integral 
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/ xcos(x 2 + l )dx 
Jo 

By using the substitution u = x 2 + 1, we obtain du = 2x dx and 

{I 

I- I / 0 2 xcos(x 2 + 1) dx 1= \ / 0 2 cos(x 2 + 1)2 xdx I- 

Note how the lower limit x = 0 was transformed 
= 2 2 + 1 = 5. 

28.6.3 Proof of the substitution rule 

We will now prove the substitution rule for definite integrals. Let Fbe an anti derivative of /so 

F'(x) = /(x) 

. By the Fundamental Theorem of Calculus 
f? f(u)du = F(d)-F(c). 

Next we define a function G by the rule 


I 1= | /fcos {u)du I- 1 1= ^(sin(5) - sin(l)). 1} 
into u = 0 2 + 1 = 1 and the upper limit x = 2 into u 


G(x) = F{u{x)) . 

Then by the Chain rule G is differentiable with derivative 

G'(x) = F'(u(x)) u'(x) = /(u(x))w'(x) . 

Integrating both sides with respect to x and using the Fundamental Theorem of Calculus we get 

rb r-b 

/ f(u{x))u'{x)dx= / G\x)dx = G{b) - G{a). 

J a J a 

But by the definition of Fthis equals 

rd 

G{b)-G{a) = F{u{b))-F{u{a)) = F{d)-F{c) = J f(u)du. 

Hence 


rb rd 

/ f(u(x))u'{x)dx= / f{u)du. 
J a Jc 
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which is the substitution rule for definite integrals. 

33 


28.7 Integration by Parts 

Continuing on the path of reversing derivative rules in order to make them useful for integration, we 
reverse the product rule. 


28.8 Integration by Parts 

If y = u v where u and v arc functions of x, 
Then y' = ( uv)' = v'u+ u! v 
Rearranging, uv' = (uv)' - vu' 
Therefore,/ uv'dx = f {uv)' dx- f vu' dx 
Therefore, / uv'dx= uv- f vu'dx, or 


/ 


udv = uv- 


/ 


v du 


This is the integration by parts formula. It is very useful in many integrals involving products of 
functions, as well as others. 

For instance, to treat 


I 


xsin xdx 


we choose u = x and dv = sin xdx. With these choices, we have du = dx and v = -cosx, and we 
have 


J xsir 


/ 


xsinxdx = -xcosx- I (-cosx) dx - -xcosx+ I cosxdx = -xcosx + sinx + C. 


/' 


Note that the choice of u and dv was critical. Flad we chosen the reverse, so that u = sinx and 
dv = xdx, the result would have been 


1 2 • f 1 z , 

-x sinx- / -x cos xdx. 

2 J 2 


33 http://en.wikibooks.org/wiki/ Category%3ACalculus%2 0%2 8book%2 9 
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The resulting integral is no easier to work with than the original; we might say that this application 
of integration by parts took us in the wrong direction. 

So the choice is important. One general guideline to help us make that choice is, if possible, to 
choose u to be the factor of the integrand which becomes simpler when we differentiate it. In the 
last example, we see that sinx does not become simpler when we differentiate it: cos x is no simpler 
than sinx. 

An important feature of the integration by parts method is that we often need to apply it more than 
once. For instance, to integrate 


/ 


x 2 e x dx 


we start by choosing u = x 2 and dv = e x to get 


/ 


x 2 e x dx = x 2 e x -2 I xe x dx. 


’-f 


Note that we still have an integral to take care of, and we do this by applying integration by parts 
again, with u = x and dv = e x dx, which gives us 


/ 


x 2 e x dx = x 2 e x -2 


I 


xe ' 


dx = x 2 e x - 2{xe x -e x ) + C-x 2 e x - 2xe x + 2e x + C. 


So, two applications of integration by parts were necessary, owing to the power of x 2 in the inte- 
grand. 

Note that any power ofx does become simpler when we differentiate it, so when we see an integral 
of the form 


J x’’f(x)dx 

one of our first thoughts ought to be to consider using integration by parts with u = x”. Of course, 
in order for it to work, we need to be able to write down an antiderivative for d v. 


28.8.1 Example 

Use integration by parts to evaluate the integral 

J sin {x)e x dx 

Solution: If we let u = sin(x) and v' = e x , then we have u' = cos(x) and v = e x . Using our rule for 
integration by parts gives 
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/ 


sin(x)e x dx= e x sin(x) - / cos(x)e x dx 




We do not seem to have made much progress. But if we integrate by parts again with u = cos(x) 
and v' = e x and hence u! = - sin(x) and v - e x , we obtain 

J sin(x)e x dx= e x sin(x) - e x cos(x) - J e x sin(x ) dx 
We may solve this identity to find the anti-derivative of e x sin(x) and obtain 


/ 


sin(x)e x dx = -e x (sin(x) - cos(x)) + C 


28.8.2 With definite integral 

For definite integrals the rule is essentially the same, as long as we keep the endpoints. 

<blockquote style="background: white; border: lpx solid black; padding: lem;"> Integration by 
parts for definite integrals Suppose /and g arc differentiable and their derivatives arc continuous. 
Then 


fb 

/ f{x)g'{x)dx = 

J a 


/(x)g(x) - f f(x)g(x)dx 

a Ja 


= f{b) g{b) - fia)gia) - I / ix)gix)dx 


r b 

/ /'( 

J a 


</blockquote> 

This can also be expressed in Leibniz notation. <blockquote style="background: white; border: lpx 
solid black; padding: lem;"> 




vdu. 


</blockquote> 
Integration by parts 34 


34 http ://en. wikipedia.org/wiki/Integration%20by%20parts 
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28.9 Integration by Complexifying 


This technique requires an understanding and recognition of complex numbers. Specifically Euler’s 
formula: 

cos0 + i ■ sin0 = e 1 ' 9 

Recognize, for example, that the real portion: 

Re{e' 0 } = cos0 


Given an integral of the general form: 
f e x cos2x dx 
We can complexify it: 

/Re{e x (cos2x + i -sin2x)} dx 
fRe{e x {e i2x )} dx 
With basic rules of exponents: 
fRe{e x+l2x } dx 

It can be proven that the "real portion" operator can be moved outside the integral: 
Re{/ e x(1 +2d 


The integral easily evaluates: 


Multiplying and dividing by (l-2i): 
Re{^-e x(1+2i) } 

Which can be rewritten as: 

Re { k^i e x e i2x } 

Applying Euler’s forumula: 

Re{^ 1 e x (cos2x+ Gsin2x)} 

Expanding: 

Re{^-(cos2x + 2sin2x) + i ■ ^-(sin2x-2cos2x)} 
Taking the Real paid of this expression: 

^-(cos2x + 2sin2x) 

So: 


/ e x cos2 xdx= ^-(cos2x + 2sin2x) + C 
35 


35 http://en.wikibooks.org/wiki /Cate gor y%3ACalculus%2 0%2 8book%2 9 
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28.10 Rational Functions by Partial Fractional Decomposition 

Suppose we want to find f ^%^-dx. One way to do this is to simplify the integrand by finding 
constants A and B so that 


3x + 1 3x +1 A B 
x 2 + x x(x+l) X X + 1 ' 

This can be done by cross multiplying the fraction which gives • As both sides 

have the same denominator we must have 3x + 1 = A(x+ 1) + Bx. This is an equation for x so must 
hold whatever value x is. If we put in x = 0 we get 1 - A and putting x = -1 gives -2 = -B so 
B = 2. So we see that 


3x+ 1 1 2 

— 7 , — 1 

X z + X X X + 1 


Returning to the original integral 


I f^ dx 1= \f d -?+fikdx I- I 1= lln|x| +2ln|x+ 1| + C I- 1 1} 

Rewriting the integrand as a sum of simpler fractions has allowed us to reduce the initial integral to 
a sum of simpler integrals. In fact this method works to integrate any rational function. 

Method of Partial Fractions: 

<blockquote style="background: white; border: lpx solid black; padding: lem;"> 

• Step 1 Use long division to ensure that the degree of P(x) less than the degree of Q(x). 

• Step 2 Factor Q(x) as far as possible. 

• Step 3 Write down the correct form for the partial fraction decomposition (see below) and solve 
for the constants. 

</blockquote> 

To factor Q(x) we have to write it as a product of linear factors (of the form ax + b ) and irreducible 
quadratic factors (of the form ax 2 + bx + c with b 2 - 4 ac < 0). 

Some of the factors could be repeated. For instance if Q(x) = x 3 - 6x 2 + 9x we factor Q(x) as 

Q(x) = x(x 2 - 6x + 9) = x(x - 3) (x - 3) = x(x - 3) 2 . 

It is important that in each quadratic factor we have b 2 - Aac < 0, otherwise it is possible to factor 
that quadratic piece further. For example if Q(x) - x 3 - 3x 2 - 2x then we can write 

Q(x) = x(x 2 -3x + 2) = x(x — 1) (x + 2) 

We will now show how to write P(x)IQ(x) as a sum of terms of the form 
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A 

( ax+b) k 


(. ax 2 + bx + c) k 

Exactly how to do this depends on the factorization of Q(x) and we now give four cases that can 
occur. 

<blockquote sty!e="background: white; border: lpx solid black; padding: lenr;"> Case (a) Q(x) is 
a product of linear factors with no repeats. </blockquote> 

This means that Q(x) = (iq x+ bi)[a 2 x + b 2 )...{a n x + b n ) where no factor is repeated and no factor 
is a multiple of another. 

For each linear term we write down something of the form (a ^ +hj , so in total we write 


PM | A 2 | | A n 

QM (flix+hi) [a 2 x + b 2 ) ( a n x + b „ ) 


28.10.1 Example 1 

Find f 

J (x+3) (x+5) (x+7) 

Here we have PM = 1 + x 2 , QM = (x + 3) (x + 5) (x + 7) and Q(x) is a product of linear factors. So 
we write 

l+x 2 _ A , B , C 

(x+3)(x+5)(x+7) — x+3 x+5 x+7 

Multiply both sides by the denominator 

1 + x^ = A(x + 5)(x + 7) + 5(x + 3)(x + 7) + C(x + 3)(x + 5) 

Substitute in three values of xto get three equations for the unknown constants, 

x = -3 1 + 3 2 = 2 • 4 A 

x = -5 1 + 5 2 = -2-2B 

x = -7 1 + 7 2 = (—4) • ( — 2) C 

so A = 5/4,5 = -13/2, C = 25/4, and 

l+x 2 _ 5 13 , 25 

(x+3) (x+5) (x+7) 4x+12 2x+10 4x+28 

We can now integrate the left hand side. 

f (x+3)(x+5)(x+7) = 4 I + 3 1 2 ” I + 5 1 + In | x + 7 1 + C 

<blockquote style="background: white; border: lpx solid black; 
padding: lem;"> 


Case (b) Q(x) is a product of linear factors some of which are repeated. </blockquote> 

If [ax + b) appeal's in the factorisation of Q(x) k-times. Then instead of writing the piece (a Y +hj we 
use the more complicated expression 


192 




Trigonometric Substitutions 


Ai i A 2 A3 . Ak 

ax+b ( ax+b ) 2 _r ( ax+b ) 3 (ax+b) k 


28.10.2 Example 2 

Find / ( * + mx + 2Y dx 

Here P(x)-1" and "Q(x)=(x+l)(x+2) 2 We write 

1 _A_ , _B_ , C 

(x+l)(x+2) 2 x+1 x+2 ^ (x+2) 2 

Multiply both sides by the denominator 1 = A{x + 2) 2 + B{x + l)(x + 2) + C(x + 1) 

Substitute in three values of x to get 3 equations for the unknown constants, 

x = 0 1 = 2 2 A + 2B + C 

x = — 1 1 = A 

x = -2 1 = -C 

so A=l, B=-l , C=-l, and 
1 _ j _ 1 1 

(x+1) (x+2) 2 x+1 x+2 (x+2) 2 

We can now integrate the left hand side. / (X+1) [ A . +Z)2 dx = In + C 

<blockquote style="background: white; border: lpx solid black; padding: lem;"> Case (c) Q(x) 
contains some quadratic pieces which are not repeated. </blockquote> 

If (ax 2 + bx+ c) appears we use (a J^+ c) ■ 

<blockquote style="background: white; border: lpx solid black; padding: lem;"> Case (d) Q(x) 
contains some repeated quadratic factors. </blockquote> 

If (ax 2 + bx + c) appears k-times then use 


A\X + B\ 


A 2 x + B 2 


+ ■ 


A3X + B% 


{ax 2 + bx + c) {ax 2 + bx + c) 2 {ax 2 + bx+c) 3 


■ H + 


AfcX + Bfc 

{ax 2 + bx + c) k 


36 


28.11 Trigonometric Substitutions 


The idea behind the trigonometric substitution is quite simple: to replace expressions involving 
square roots with expressions that involve standard trigonometric functions, but no square roots. 
Integrals involving trigonometric functions arc often easier to solve then integrals involving square 
roots. 

Let us demonstrate this idea in practice. Consider the expression Vl-x 2 . Probably the most basic 
trigonometric identity is sin 2 (0) + cos 2 (0) = 1 for an arbitrary angle 6. If we replace x in this 
expression by sin(d), with the help of this trigonometric identity we see 


36 http://en.wikibooks.org/wiki/ Category%3ACalculus%2 0%2 8book%2 9 
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Vl-x 2 = \J l-sin 2 (0) = \/ ~ cos 2 (0) = cos(0) 

Note that we could write 6 = arcsin(x) since we replaced x 2 with sin 2 (0). 

We would like to mention that technically one should write the absolute value of cos(0), in other 
words | cos(0)| as our final answer since = \A\ for all possible A. But as long as we are careful 
about the domain of all possible x and how cos(O) is used in the final computation, omitting the 
absolute value signs does not constitute a problem. However, we cannot directly interchange the 
simple expression cos(O) with the complicated V 1 - x 2 wherever it may appeal - , we must remember 
when integrating by substation we need to take the derivative into account. That is we need to 
remember that dx = cos (0) dd, and to get a integral that only involves 6 we need to also replace dx 
by something in terms of dO. Thus, if we see an integral of the form 



we can rewrite it as 


/ 


cos(6)cos6 d6 = I cos 2 Odd. 


/' 


Notice in the expression on the left that the first cos 6 comes from replacing the V l - x 2 and the 
cos 6 dd comes from substituting for the dx. 

Since cns 2 (0) = | (1 + cos(20)) our original integral reduces to: 


ifdd+lf 


cos(2 9)dd 


These last two integrals are easily handled. For the first integral we get 


if 


2 . d6 = ^ 6 


For the second integral we do a substitution, namely u = 26 (and du = 2 dd) to get: 


| j cos(2 9)d6 = ^ J cos u^du= |sinw = |sin(20) 


Finally we see that: 


J cos 2 6 d6 = + \ sin(20) = \Q + ^ sin(0) cos(9) 

However, this is in terms of 6 and not in terms of x, so we must substitute back in order to rewrite 
the answer in terms of x. 

That is we worked out that: 
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sin(0) = x cos(0) = V 1 - x 2 and 6 = arcsin(x) 
So we arrive at our final answer 


J \/l- x 2 dx = | arcsin(x) + | xV 1 - x 2 

As you can see, even for a fairly harmless looking integral this technique can involve quite a lot of 
calculation. Often it is helpful to see if a simpler method will suffice before turning to trigonometric 
substitution. On the other hand, frequently in the case of integrands involving square roots, this is 
the most tractable way to solve the problem. We begin with giving some rules of thumb to help you 
decide which trigonometric substitutions might be helpful. 

If the integrand contains a single factor of one of the forms V a 2 - x 2 or V a 2 + x 2 or V x 2 - a 2 we 
can try a trigonometric substitution. 

<blockquote style="background: white; border: lpx solid black; padding: lem;"> 

• If the integrand contains V a 2 - x 2 let x = asinO and use the IDENTITY 37 1 - sin 2 6 = cos 2 6. 

• If the integrand contains V a 2 + x 2 let x = a tanfl and use the identity 1 + tan 2 6 = sec 2 0. 

• If the integrand contains V x 2 - a 2 let x = a sec 6 and use the identity sec 2 9 - 1 = tan 2 0. 

</blockquote> 



WIKIVERSITY 


Figure 40: TRIGONOMETRIC SUBSTITUTIONS 38 


. . /tmp/41 .png 


Figure 41: TRIGONOMETRIC SUBSTITUTIONS 39 


37 Chapter 64 on page 417 

38 http://en.wikiversity.org/wiki/Trigonometric%20Substitutions 

39 http ://en. wikipedia.org/wiki/Trigonometric%20substitution 
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28. 1 1 . 1 Sine substitution 



Figure 42: This substitution is easily derived from a triangle, using the PYTHAGOREAN 
Theorem 40 . 


If the integrand contains a piece of the form V a 2 - x 2 we use the substitution 


x = asin0 dx = acosOdd 

This will transform the integrand to a trigonometric function. If the new integrand can’t be inte- 
grated on sight then the tan-half-angle substitution described below will generally transform it into 
a more tractable algebraic integrand. 

E.g., if the integrand is &radic;( I -x 2 ), 

/o VT - x 2 dx = Jq IZ V 1 - sin 2 6 cos Odd 
= Jq 12 cos 2 6 d6 

= l/o^l + cos 26 dB 

n 

4 

If the integrand is &radic;(l+x)/&radic;(l-x), we can rewrite it as 


1 + x 
1 - x 


1 + x 1 + x 
1 + x 1 — X 


1 + X 

Vl -x 2 


Then we can make the substitution 


40 http://en.wikipedia.org/wiki /P ythagore an_Theorem 
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ra 1+x 
Jo n/Wx 5 


dx 


fo 1 cosg 0 cosOdB 0<n<l 
f^l + sinddd a = sin -1 a 
a + [-cos0]q 
a + l-cosa 
1 + sin -1 a - v 7 1 - a 2 


28. 1 1 .2 Tangent substitution 



Figure 43: This substitution is easily derived from a triangle, using the PYTHAGOREAN 
Theorem 41 . 


When the integrand contains a piece of the form V a 2 + x 2 we use the substitution 

x = atand \J x 2 + a 2 = a sec 9 dx=asec 2 dd0 
E.g., if the integrand is (jr+a 2 )" 3/2 then on making this substitution we find 


/ 0 Z (x 2 + a 2 ) 2 dx 


a ~ 2 fo cosddO 
a~ 2 [sin0]o 
cC 2 sin a 

2 zla 
\/l +z 2 la 2 


Z > 0 

a = tan _1 (z/a) 
— J_ z 

fl2 \/a 2 +z 2 


If the integral is 


1= f V x 2 + a 2 
Jo 


z> 0 


41 http://en.wikipedia.org/wiki /P ythagore an_Theorem 
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then on making this substitution we find 


I 


a 2 / 0 “ sec 3 d dd 
a 2 / 0 “ sec d dtand 
a 2 [ sec0tan0]g 
a 2 sec a tan a 
a 2 sec a tan a 


a = tan 1 {zl a) 


a 2 Jq seed tan 2 6 dd 
a 2 Jq sec 3 d dd 
I 


+a 2 Jq seed dd 
+a 2 fa seed dd 


After integrating by parts, and using trigonometric identities, we’ve ended up with an expression 
involving the original integral. In cases like this we must now rearrange the equation so that the 
original integral is on one side only 


I = ^fi 2 sec a tan a + 

= ^ a 2 sec a tan a + 

= Tjd 2 sec a tan a + 



= ^zVz 2 + a 2 + 


^a 2 Jq seed dd 
\ a 2 [In (sec d + tan 6) ] q 
| a 2 In (sec a + tan a) 


As we would expect from the integrand, this is approximately z 2 /2 for large z. 

In some cases it is possible to do trigonometric substitution in cases when there is no y A — appealing 
in the integral. 

Example 


f — ^ — dx 

J x 2 + 1 

The denominator of this function is equal to (\/l + x 2 ) 2 . This suggests that we try to substitute x = 
tan u and use the identity 1 + tan 2 u = sec 2 u . With this substitution, we obtain that dx = sec 2 u d u 
and thus 


/ — =- — dx= f — ^ sec 2 udu 

x z + l J tan 2 u + 1 

-[A 

J sec 2 


-sec udu 


I 


= du 


= u + c 


Using the initial substitution u = arctanx gives 
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/ 


1 

x 2 + 1 


dx = arctanx + C 


28. 1 1 .3 Secant substitution 




Figure 44: This substitution is easily derived from a triangle, using the PYTHAGOREAN 
Theorem 42 . 


If the integrand contains a factor of the form V x 2 - a 2 we use the substitution 


x = asecd dx = asecdtanOdO V x 2 - a 2 = a Land. 


Example 1 


Find 


A z 


f; 


\/ x 2 - 1 


dx. 


— dx = 


fi secdtanddO 


ra tand . 

Sci 

/ 0 “ tan 2 d d6 
[tan0 -0 ]q 
tan a - a 


\J z 2 - 1 - 


sec 


-l 


z> 1 

a = sec -1 z 
tan a = v 7 sec 2 a - 1 
tan a = \/z 2 -l 


Example 2 

V x 2 - 1 


Find 


r 


rfx. 


42 http://en.wikipedia.org/wiki /P ythagore an_Theorem 
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= f! 


oc tan 6 


sec 0 tan 6 d6 


sec 2 0 

f 0 as -^dO 


JO cos 6 

We can now integrate by parts 

fi dx = - [tan 0 cos 0]^ +/ 0 “ sect? d0 

= -sina + [In(sec0 + tan0)]g 
= In (sec a + tan a) - sin a 

= ln(z+\/z2^T)-^P 


z> 1 

a = sec -1 z 


43 


28.12 Trigonometric Integrals 


When the integrand is primarily or exclusively based on trigonometric functions, the following 
techniques are useful. 


28. 1 2. 1 Powers of Sine and Cosine 

We will give a general method to solve generally integrands of the form cos"' (x)sin"(x). First let 
us work through an example. 


J (cos 3 x) (sin 2 x) dx 

Notice that the integrand contains an odd power of cos. So rewrite it as 


/ 


(cos 2 x) (sin 2 x) cos x dx 


We can solve this by making the substitution u = sin(x) so d u = cos(x) dx. Then we can write the 
whole integrand in terms of u by using the identity 

cos(x) 2 = 1 - sin 2 (x)=l-u 2 . 

So 


/ (cos 3 x) (sin 2 x) dx 


f (cos 2 x) (sin 2 x) cos x dx 
/(I - u 2 )u 2 du 
f u 2 du- f u 4 du 
^ u 3 + ^ / r' + C 
| sin 3 x - | sin 5 x + C 


43 http://en.wikibooks.org/wiki /Cate gor y%3ACalculus%2 0%2 8book%2 9 
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This method works whenever there is an odd power of sine or cosine. 

<blockquote style="background: white; border: lpx solid black; padding: lem;"> To evaluate 
/ (cos'" x) (sin" x) dx when either m or n is odd. 

• If m is odd substitute u=sin x and use the identity cos 2 x = 1 - sin 2 x^l-u 2 . 

• If n is odd substitute z<= cos x and use the identity sin 2 x = 1 - cos 2 x=l-u 2 . 

</blockquote> 

Example 

Find 12 cos 40 (x) sin 3 (x)dx. 

As there is an odd power of sin we let u = cosx so du = - sin(x)dx. Notice that when x=0 we have 
u=cos(0)-l and when x = nl 2 we have u = cos(7r/2) = 0. 

Jq 12 cos 40 (x) sin 3 (x)dx = /J r/2 cos 40 (x) sin 2 (x) sin(x)dx 

= -jf n 40 (l - u 2 )du 

= Jq u 40 [l- u 2 )du 

= J^u i0 -u i2 du 

f _L 7/41 L 7/43l 1 

- Ul w 4j5 U J 0 

“ TT “ 43‘ 

When both m and n are even things get a little more complicated. 

<blockquote style="background: white; border: lpx solid black; padding: lem;"> To evaluate 
/ (cos'" x) (sin" x) dx when both m and n are even. 

Use the IDENTITIES 44 sin 2 x= 1/2 (1- cos 2x) and cos 2 x= 1/2 (1+ cos 2x). </blockquote> 


Example 

Find / sin 2 xcos 4 xdx. 

As sin 2 x= 1/2 (1- cos 2x) and cos 2 x= 1/2 (1+ cos 2x) we have 


J sin 2 xcos 4 xdx = y|^(l-cos2x) 


,-(l + cos2x) dx, 
(2 1 


and expanding, the integrand becomes 


1 

8 



-cos 2 2x + cos2x-cos 3 2x) dx. 


Using the multiple angle identities 


44 Chapter 64 on page 417 
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I = g (/ldx-/cos 2 2xdx + /cos2xdx-/cos 3 2xdx) 

= g (x- \f (1 + cos4x) dx + ^sin2 x- f cos 2 2xcos2 xdx) 
= yg (x + sin2x + / cos4xdx-2 f (1 -sin 2 2x)cos2xdx) 

then we obtain on evaluating 


x sin4x sin 3 2x 

/ — 1 h C 

16 64 48 

28.12.2 Powers of Tan and Secant 

<blockquote style="background: white; border: lpx solid black; padding: lem;"> To evaluate 
/ (tan'” x) (sec” x) dx. 

1. If n is even and n > 2 then substitute z/= tan x and use the IDENTITY 45 sec 2 x = 1 + tan 2 x. 

2. If n and m are both odd then substitute zz=sec x and use the IDENTITY 46 tan 2 x = sec 2 x-l. 

3. If n is odd and m is even then use the IDENTITY 47 tan 2 x = sec 2 x-l and apply a reduction 
formula to integrate sec 1 xdx. 

</blockquote> 


Example 1 

Find / sec 2 xdx. 

There is an even power of secx. Substituting u = tanx gives du = sec 2 xdx so 
/ sec 2 xdx = f du= u + C = tan x+C. 


Example 2 


Find / Ian xdx. 


Let u = cosx so du = - sin xdx. Then 


/ tan xdx 


r^ x dx 

firdu 


— In | w | + C 
-In | cosx| + C 
ln|secx| + C. 


45 Chapter 64 on page 417 

46 Chapter 64 on page 417 

47 Chapter 64 on page 417 
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Example 3 


Find / sec xdx. 

The trick to do this is to multiply and divide by the same thing like this: 
f sec xdx = /secxg§f±f§lfdx 

r sec 2 x+secxtanx • 

J secx+tanx 

Making the substitution u = secx + tanx so du = sec x tan x + sec 2 xdx, 

f sec xdx - f u^ u 

= In \u\ + C 

= ln|secx + tanx| + C 


28.12.3 More trigonometric combinations 


<blockquote style="background: white; border: lpx solid black; padding: lem;"> For the integrals 
/ sinnxcosmxdx or / sinnxsinmxrfx or / cosnxcosmxdx use the IDENTITIES 48 


sin a cosb = ^(sin(a + b) + sin (a - b)) 
sin a sin b= ^(cos(a - b) - cos(a + b)) 
cos a cos b = A (cos [a-b) + cos (a + b)) 


</blockquote> 


Example 1 

Find / sin3xcos5xdx. 

We can use the fact that sin a cos F=(l/2)(sin(fl+h)+sin(a-h)), so 

sin3xcos5x= (sin8x + sin(-2x))/2 
Now use the oddness property of sin(x) to simplify 

sin3xcos5x= (sin8x- sin2x)/2 

And now we can integrate 

/sin3xcos5xdx = ^ /sin8x- sin2xdx 

= |(-^cos8x+ ^cos2x) + C 


48 Chapter 64 on page 417 
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Example 2 

Find: / sin x sin 2 xdx. 
Using the identities 


1 1 

sinxsin2x = - (cos(-x) -cos(3x)) = -(cosx-cos3x). 


Then 


/ sinxsin2xdx 


| / (cosx-cos3 x)dx 
i(sinx- |sin3x) + C 


49 


28.13 Reduction Formula 

A reduction formula is one that enables us to solve an integral problem by reducing it to a problem 
of solving an easier integral problem, and then reducing that to the problem of solving an easier 
problem, and so on. 

For example, if we let 


-/ 


I n - / x n e x dx 


Integration by parts allows us to simplify this to 




I n = x n e x - n I x" l e x dx = 


I„ = x n e x -nl n -i 

which is our desired reduction formula. Note that we stop at 


Io = 


e 


X 


Similarly, if we let 


In = 


f 


sec n 9d0 


49 http://en.wikibooks.org/wiki /Cate gor y%3ACalculus%2 0%2 8book%2 9 
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then integration by parts lets us simplify this to 


I n = sec" 2 a tan a -{n- 2) / sec" z 0tan z 0r?0 


/ 


-*7? 2 n / 


Using the trigonometric identity, tan 2 =sec 2 - 1 , we can now write 


I n = sec" 2 a tan a +{n-2) (/ 0 “sec" 2 6 dd - sec" 9 d6) 
- sec" -2 a tan a +{n-2) {I n -2 - I n ) 


Rearranging, we get 


1 „-2 n-2 

I n = sec a tan a H J „_2 

n - 1 n - 1 

Note that we stop at n= 1 or 2 if n is odd or even respectively. 

As in these two examples, integrating by parts when the integrand contains a power often results in 
a reduction formula. 

50 


28.14 Irrational Functions 


Integration of irrational functions is more difficult than rational functions, and many cannot be 
done. However, there are some particular types that can be reduced to rational forms by suitable 
substitutions. 


28.14.1 Type 1 


Integrand contains 


ax+b 

cx+d 


Use the substitution u = \J ffpf . 

Example 


™ fy^dx. 


I 


V ax.+b 


dx 


50 http://en.wikibooks.org/wiki/ Category%3ACalculus%2 0%2 8book%2 9 
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28.14.2 Type 2 

Integral is of the form / ^ i==|L= dx 

Write Px + Q as Px + Q = pjj^(ax 2 + bx + c) + q. 

Example 

Find f -dx. 

J V5-4x-.\- 

28.14.3 Type 3 

Integrand contains Vet 2 - x 2 , va 2 + x 2 or Vx 2 - a 2 

This was discussed in "trigonometric substitutions above". Here is a summary: 

1. For V a 2 - x 2 , use x = asinQ. 

2. For V a 2 + x 2 , use x = a tan0. 

3. For V x 2 - a 2 , use x = asecO. 



51 http://en.wikibooks.org/wiki /Cate gory%3ACalculus%2 0%2 8book%2 9 
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Numerical Approximations 


28.15 Numerical Approximations 


It is often the case, when evaluating definite integrals, that an antiderivative for the integrand cannot 
be found, or is extremely difficult to find. In some instances, a numerical approximation to the value 
of the definite value will suffice. The following techniques can be used, and are listed in rough order 
of ascending complexity. 


28.16 RiemannSum 

This comes from the definition of an integral. If we pick n to be finite, then we have: 
fa fix) dx*Y,? =1 f(x*)\x 

where x* is any point in the i-th sub-interval [x,-_i,x,-] on [a,b]. 

28.16.1 Right Rectangle 

A special case of the Rienrann sum, where we let x* = x,-, in other words the point on the far 
right-side of each sub-interval on, [a,b]. Again if we pick n to be finite, then we have: 

fa fix) dx = If =1 /(x,;)Ax 

28.16.2 Left Rectangle 

Another special case of the Rienrann sum, this time we let x* = x;_ i, which is the point on the far 
left side of each sub-interval on [a,b]. As always, this is an approximation when n is finite. Thus, 
we have: 

fa fix) dx = L" =1 /(X;_ i)Ax 

28.17 Trapezoidal Rule 

fa fix) dx* [/(x 0 )+2i;. i - 1 1 (/(X/))+ fix n )] = (/(x 0 ) + 2/(Xi) + 2/(x 2 ) + • • • + 2fix n -l) + 

fix,,)) 

28.18 Simpson’s Rule 

Remember, n must be even. 


I- I fa fix) dx 1= [/(xo) + Z” =1 1 {(3-(- 1 )')/(x l )) + /(x„)] I- I 1= 

^ [/(x 0 ) + 4/(xi) + 2 fix 2 ) + 4 fixf) + • • • + 4/(x„_i) + f (X/ 7 ) ] I } 
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28.19 Further reading 

• Numerical Methods/Numerical Integration 52 

53 


52 http://en.wikibooks.org/wiki /Numeri cal%20Method s%2FNumeri cal%2 OIntegrat ion 

53 http://en.wikibooks.org/wiki /Cate gory%3ACalculus%2 0%2 8book%2 9 
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29 Improper Integrals 


1. REDIRECT Calculus/Improper Integrals 1 


29.1 Exercises 

29.2 Set One: Sums 


[Insert Numbered Problems Here] 
Solutions to Set One 2 


29.3 Set Two: Integration of Polynomials 

Given the above rules, practice indefinite integration on the following: 

1. f{x 2 -2) 2 dx 

2. f 8x 3 dx 

3. / 4x 2 + 11 x 3 dx 

4. f 31x 32 + 4x 3 -9x 4 dx 

5. f 5 x~ 2 dx 

Solutions to Set Two 3 


29.4 Indefinite Integration 


Antiderivatives 

1. / cos x + sin xdx 

2. / 3 sin xdx 

3. / 1 + tan 2 xdx 

4. f 3x-sec 2 xdx 

5. f-e x dx 

6. f 8 e x dx 

I T^ dx 


1 http://en.wikibooks.org/wiki/Calculus%2FImproper%20Integrals 

2 Chapter 62. 1 1.1 on page 413 

3 Chapter 62.13 on page 413 
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8 - / 


1 

x 2 +a 2 


dx 


29.5 Integration by parts 

1. Consider the integral / sin(x) cos(x) dx. Find the integral in two different ways, (a) Integrate 
by parts with u = sin(x) and v' = cos(x). (b) Integrate by parts with u = cos(x) and v' = 
sin(x). 

Compare your answers. Are they the same? 

Solutions to Set Three 4 


4 Chapter 62.14 on page 413 
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31 Area 


31.1 Introduction 

Finding the area between two curves, usually given by two explicit functions, is often useful in 
calculus. 

In general the rule for finding the area between two curves is 
A = A t0 p ~ Abottom or 

If fix) is the upper function and g(x) is the lower function 
A = /a lf{x)-g[x)]dx 

This is true whether the functions are in the first quadrant or not. 


31.2 Area between two curves 

Suppose we arc given two functions yi=f(x) and y2=g(x) and we want to find the area between them 
on the interval [a,b]. Also assume that f(x)> g(x) for all xon the interval [a,b]. Begin by partitioning 
the interval [a,b] into n equal subintervals each having a length of A x=(b-a)/n. Next choose any 
point in each subinterval, Xj*. Now we can ’create’ rectangles on each interval. At the point x,-*, 
the height of each rectangle is f(Xi*)-g(Xj*) and the width is Ax Thus the area of each rectangle is 
[f(Xj *) -g(Xi *) ]Ax. An approximation of the area, A, between the two curves is 


A:= £[/(x-)-g(x-)]Ax 
j=i 

Now we take the limit as n approaches infinity and get 


A = 


n 


hrn^ g{x*)]Ax 

;=i 


which gives the exact area. Recalling the definition of the definite integral we notice that 


A = f [/(x) - g(x)] dx 
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This formula of finding the area between two curves is sometimes known as applying integration 
with respect to the x-axis since the rectangles used to approximate the area have their bases lying 
parallel to the x-axis. It will be most useful when the two functions arc of the form yi=f(x) and 
y2-g(x). Sometimes however, one may find it simpler to integrate with respect to the y-axis. This 
occurs when integrating with respect to the x-axis would result in more than one integral to be 
evaluated. These functions take the form Xi-f(y) and X2-g(y) on the interval [c,d]. Note that [c,d] 
arc values of y. The derivation of this case is completely identical. Similar to before, we will assume 
that f(y)> g(y) for all y on [c,d]. Now, as before we can divide the interval into n subintervals and 
create rectangles to approximate the area between f(y) and gfy). It may be useful to picture each 
rectangle having their ’width’, Ay, parallel to the y-axis and ’height’, f(yi*)-g(yi*) at the point y,-* 
parallel to the x-axis. Following from the work above we may reason that an approximation of the 
area. A, between the two curves is 

a := L[/0'f)-g(yi)] A y 

i=l 

As before, we take the limit as n approaches infinity to arrive at 

a = Hm X [/O'* ) - giy* )1 ■ Ay 

i=l 

which is nothing more than a definite integral, so 

a = J [/O') - g(y)i d y 

Regardless of the form of the functions, we basically use the same formula. 
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32 Volume 


When we think about volume from an intuitive point of view, we typically think of it as the amount 
of "space" an item occupies. Unfortunately assigning a number that measures this amount of space 
can prove difficult for all but the simplest geometric shapes. Calculus provides a new tool for 
calculating volume that can greatly extend our ability to calculate volume. In order to understand 
the ideas involved it helps to think about the volume of a cylinder. The volume of a cylinder is 
calculated using the formula V = nr 2 h. The base of the cylinder is a circle whose area is given 
by A = nr 2 . Notice that the volume of a cylinder is derived by taking the area of its base and 
multiplying by the height h. For more complicated shapes, we could think of approximating the 
volume by taking the area of some cross section at some height x and multiplying by some small 
change in height Ax then adding up the heights of all of these approximations from the bottom to the 
top of the object. This would appear to be a Riemann sum. Keeping this in mind, we can develop a 
more general formula for the volume of solids in (R 3 (3 dimensional space). 


32.1 Formal Definition 

Formally the ideas above suggest that we can calculate the volume of a solid by calculating the 
integral of the cross-sectional area along some dimension. In the above example of a cylinder, the 
every cross section was given by the same circle, so the cross-sectional area is therefore a constant 
function, and the dimension of integration was vertical (although it could have been any one we 
desired). Generally, if S is a solid that lies in IR 3 between x = a and x = b, let A(x) denote the area 
of a cross section taken in the plane perpendicular to the x direction, and passing through the point 
x. If the function A(x) is continuous on [a, b], then the volume Vs of the solid S is given by: 

Vs= f A(x) dx. 

J a 
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32.2 Examples 

32.2.1 Example 1: A right cylinder 


Figure 45 



Now we will calculate the volume of a right cylinder using our new ideas about how to calculate 
volume. Since we already know the formula for the volume of a cylinder this will give us a "sanity 
check" that our formulas make sense. First, we choose a dimension along which to integrate. In 
this case, it will greatly simplify the calculations to integrate along the height of the cylinder, so 
this is the direction we will choose. Thus we will call the vertical direction (see diagram) x. Now 
we find the function, A(x), which will describe the cross-sectional area of our cylinder at a height 
of x. The cross-sectional area of a cylinder is simply a circle. Now simply recall that the area of a 
circle is nr 2 , and so A{x) = nr 2 . Before performing the computation, we must choose our bounds 
of integration. In this case, we simply define x = 0 to be the base of the cylinder, and so we will 
integrate from x = 0 to x = h, where h is the height of the cylinder. Finally, we integrate: 


^cylinder 1= fa A(x) dx I- 1 1= f{f nr 2 dx I- 1 1= nr 2 Jq dx I- 1 1= nr 2 x\ h x=(j I- 1 1= nr 2 {h-0 ) I- 1 1= nr 2 h. 
I } This is exactly the familiar formula for the volume of a cylinder. 
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32.2.2 Example 2: A right circular cone 



Figure 46 


For our next example we will look at an example where the cross sectional area is not constant. 
Consider a right circular cone. Once again the cross sections are simply circles. But now the radius 
varies from the base of the cone to the tip. Once again we choose x to be the vertical direction, with 
the base at x = 0 and the tip at x = h, and we will let R denote the radius of the base. While we 
know the cross sections are just circles we cannot calculate the area of the cross sections unless we 
find some way to determine the radius of the circle at height x. 



V 

R 


Figure 47 
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Luckily in this case it is possible to use some of what we know from geometry. We can imagine 
cutting the cone perpendicular to the base through some diameter of the circle all the way to the tip 
of the cone. If we then look at the flat side we just created, we will see simply a triangle, whose 
geometry we understand well. The right triangle from the tip the base at height x is similar to the 
right triangle with from the tip with height h triangle. This tells us that = j r So that we see that 
the radius of the circle at height x is r(x) = j{h-x). Now using the familiar formula for the area of 
a circle we see that A{x) = 7 . 

Now we are ready to integrate. 


I Leone 1= fa A (x)dx I- 1 1= /o' (h - x) 2 dx I- 1 1= 7r|^ 
let u - h - x, then du = -dx and our integral becomes 


fo\h- x) 2 dx I- 


IBy u-substitution we may 
h\ 


u du \ I- 1 1= n 


I- I 1= 7r|r(- 0+ I- 1 1= ^nR 2 h. 1} 


32.2.3 Example 3: A sphere 



Figure 48 


In a similar fashion, we can use our definition to prove the well known formula for the volume of a 
sphere. First, we must find our cross-sectional area function, A{x). Consider a sphere of radius R 
which is centered at the origin in R 3 . If we again integrate vertically then x will vary from -R to R. 
In order to find the area of a particular cross section it helps to draw a right triangle whose between 
the center of the sphere, the center of the circular cross section, and a point along the circumference 
of the cross section. As shown in the diagram the side lengths of this triangle will be R, |x|, and r. 
Where r is the radius of the circular cross section. Then by the Pythagorean theorem r = \/ R 2 -\x\ 2 
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and find that A(x) = n{R 2 - \x\ 2 ). It is slightly helpful to notice that \x\ 2 = x 2 so we do not need to 
keep the absolute value. 

So we have that 


^sphere 1= f„Mx) dx I- I 1= f R R 7l(R 2 - X 2 ) dx I- 



nR 2 {R-i-R))-n^~ I- 


1 = 7X f R R R 2 dx-n f R R x 2 dx 
1 = 2 tiR 3 - 1 7tR 3 = | nR 3 . 1 } 


nR 2 x 


R 

x=-R 


32.3 Extension to Non-trivial Solids 

Now that we have shown our definition agrees with our prior knowledge, we will see how it can 
help us extend our horizons to solids whose volumes are not possible to calculate using elementary 
geometry. 

l 


1 http://en.wikibooks.org/wiki/ Category%3ACalculus%2 0%2 8book%2 9 
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33 Volume of solids of revolution 


33.1 Revolution solids 

A solid is said to be of revolution when it is formed by rotating a given curve against an axis. 
For example, rotating a circle positioned at (0,0) against the y-axis would create a revolution solid, 
namely, a sphere. 


33. 1 . 1 Calculating the volume 

Calculating the volume of this kind of solid is very si mi lar to calculating any VOLUME 1 : we calcu- 
late the basal area, and then we integrate through the height of the volume. 

Say we want to calculate the volume of the shape formed rotating over the x-axis the area contained 
between the curves f(x ) and g(x) in the range [a,b]. First calculate the basal area: 

\nf{x) 2 3 -ng{x) 2 \ 


And then integrate in the range [a,b] : 

r*b r*b 

/ \nf{x) 2 -ng(x) 2 \dx = nl \f[x) 2 -g[x) 2 \dx 

Ja Ja 

Alternatively, if we want to rotate in the y-axis instead, / and g must be invertible in the range [a,b], 
and, following the same logic as before: 

n f I/ -1 (x) 2 - g _1 (x) 2 1 dx 

Ja 

Solid of revolution 2 

3 


1 Chapter 32 on page 215 

2 http ://en. wikipedia. org/wiki/Solid%20of%20revolution 

3 http://en.wikibooks.org/wiki/ Category%3ACalculus%20%28book%2 9 
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34 Arc length 


Suppose that we arc given a function /and we want to calculate the length of the curve drawn out 
by the graph of f. If the graph were a straight line this would be easy — the formula for the length 
of the line is given by Pythagoras’ theorem. And if the graph were a polygon we can calculate the 
length by adding up the length of each piece. 

The problem is that most graphs are not polygons. Nevertheless we can estimate the length of the 
curve by approximating it with straight lines. Suppose the curve C is given by the formula y = /(x) 
for a< x< b. We divide the interval [a, b] into n subintervals with equal width Ax and endpoints 
xo,Xi,...,x„. Now let y, = /(x ; ) so Pi = (x,-, y,) is the point on the curve above x,-. The length of 
the straight line between Pi and P,+] is 


iPi+ tl = \J iyi+i ~ Vi ) 2 + {x i+ i - X;) 2 . 


So an estimate of the length of the curve C is the sum 


n - 1 

E \ p i p i+i\ 

i = 0 

As we divide the interval [a, b ] into more pieces this gives a better estimate for the length of C. In 
fact we make that a definition. 


34.1 Definition (Length of a Curve) 


The length of the curve y = /(x) for a < x < b is defined to be L = lim„_ oo Xlf _ 0 1 \ P,+] P/|. 


34.2 The Arclength Formula 

Suppose that f is continuous on [a, b\. Then the length of the curve given by y = f{x) between a 
and b is given by 


L = 



\Jl + {f'{x)) 2 dx 


And in Leibniz notation 
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f 


L = t/l + 

la 


1 dy } 2 

\dxj 


dx 


Proof: Consider y,+i - y, = /(x, + i) - /(x,). By the Mean Value Theorem 1 there is a point z ; - 
in (x ;+ i,x ; ) such that 

y/+ 1 - yi = f(xi+ 1) - /(xi) = f\zi){xi + 1 - x/) 


So 


IIP^'+il 1= l\/(y/ + i -y,:) 2 + (x ; - + i-x ; -) 2 I- I 1= \\/ {f'{Zi)) 2 {Xi+i - x t ) 2 + (x i+ i -x ; -) 2 I- I 1= 
l^(l + (/'fe)) 2 )(X; + i-X ;) 2 I- I 1= lV(l + (/'(Z;)) 2 )Ax. 1} 

Putting this into the dehniton of the length of C gives 


L = lim Yj \f d + (/'(z,-)) 2 )Ax. 

n^oo V 
z=0 

Now this is the definition of the integral of the function g(x) = \/\ + (fix)) 2 between a and b (notice 
that gis continuous because we are assuming that f is continuous). Hence 


L = 



\Jl + {f'{x)) 2 dx 


as claimed. 


34.3 Arclength of a parametric curve 

For a parametric curve, that is, a curve defined by x = fit) and y = git), the formula is slightly 
different: 


L= C J if it)) 2 + ig’ it)) 2 dt 

J a 

Proof: The proof is analogous to the previous one: Consider y !+ i - y,- = g(f/+i) - g(t,) and x, + i - 
Xi = fitj+i) — fiti). By the Mean Value Theorem there are points c, and d, in (fi+i, f/) such that 


1 Chapter 27 on page 177 
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yi+i-yt = g(t i+ i) - gw = g'(ci)(t i+ i - tj ) 


and 


Xi+i-Xi = f{t i+ i)- f{ti) = f\di)(t i+l - ti ) 


So 


\\PiPi + tl 1= I V (y/+ 1 - T;) 2 3 + (X ;+1 - Xi) 2 I- I 1= \\ / (g'{c i )) 2 (t i+ i - ti) 2 + (f'(dj)) 2 {ti+i - ti) 2 I- I 1= 
lV(/'Wi)) 2 ) + (g'(c/)) 2 )(f/ + i-f ; ) 2 I- I 1= iy(/'W ; )) 2 + (g'(c ; )) 2 Af. 1} 

Putting this into the definiton of the length of the curve gives 


L = lim 


) X \/(/'W/)) 2 + (g'(c/)) 2 Ar. 


i=0 


This is equivalent to: 

L= f J (/'(t)) 2 + ig r {t)) 2 dt 

Ja 

Arc length 2 

3 


2 http ://en. wikipedia.org/wiki/Arc%20length 

3 http://en.wikibooks.org/wiki/ Category%3ACalculus%20%28book%2 9 
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35 Surface area 


SURFACE_OF_REVOLUTION 1 

Suppose we arc given a function / and we want to calculate the surface area of the function / ro- 
tated around a given line. The calculation of surface area of revolution is related to the arc length 
calculation. 

If the function / is a straight line, other methods such as surface area formulas for cylinders and 
conical frustra can be used. However, if /is not linear, an integration technique must be used. 

Recall the formula for the lateral surface area of a conical frustum: 


A = 2nrl 

where r is the average radius and l is the slant height of the frustum. 

For y=f(x) and a < x < b, we divide [a,b] into subintervals with equal width x and endpoints 
XQ,X\,...,x n . We map each point y/ = /(x,-) to a conical frustum of width x and lateral surface 
area Ai . 

We can estimate the surface area of revolution with the sum 


n 

A =Y J A i 

;'=0 

As we divide [a,b] into smaller and smaller pieces, the estimate gives a better value for the surface 
area. 


35.1 Definition (Surface of Revolution) 

The surface area of revolution of the curve y=f(x) about a line for a < x < b is defined to be 
A = lim„_ oo X” =0 Aj 

35.2 The Surface Area Formula 

Suppose /is a continuous function on the interval [a,b] and r(x) represents the distance from f(x) to 
the axis of rotation. Then the lateral surface area of revolution about a line is given by 

1 http://en.wikipedia.org/wiki/ Surf ace_of_re volut i on 
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A = 2nf r{x)J 1 + {f'{x)) 2 dx 
J a v 

And in Leibniz notation 



Proof: 


\A 1= Him^ool 'J =l Ai I- 1 1= Him M I" =1 2 nnk I- I 1= I2wlim„^ 00 lf =1 r,- I- 1 1} 

As n -* oo and Ax — 0, we know two things: 

1 . the average radius of each conical frustum r ; - approaches a single value 

2. the slant height of each conical frustum /,• equals an infitesmal segment of arc length 
From the arc length formula discussed in the previous section, we know that 

li = yJl + (f'(Xi )) 2 

Therefore 


I A 1= I27rlim„^ 00 i;' 7 =1 r,/, I- 1 1= \2n\im. n ^ 00 Y^ =x n\A + (/'(x/)) 2 Ax 1-11} 

Because of the definition of an integral f{x)dx = lim^oo L” =1 /(c,)Ax/, we can simplify the 
sigma operation to an integral. 


A = 2n f r{x)J 1 + {f'{x)) 2 dx 
J a v 

Or if /is in terms of y on the interval [c,d] 

A = 2nj^ r{y)\J \ + {f'{y)) 2 dy 


2 


2 http://en.wikibooks.org/wiki /Cate gor y%3ACalculus%2 0%2 8book%2 9 


228 



36 Work 


W = f F dr = f madr = f m^dr = mf ^fcdv = mf vdv = jmv 2 = A 
l 


1 http://en.wikibooks.org/wiki/ Category%3ACalculus%2 0%2 8book%2 9 
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37 Centre of mass 


37.1 Exercises 

See the exercises for Integration 


1 http://en.wikibooks.org/wiki/ Category%3ACalculus%2 0%2 8book%2 9 
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38 Parametric Equations 
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39 Introduction 


39.1 Introduction 

Parametric equations are typically definied by two equations that specify both the x and y coordi- 
nates of a graph using a parameter. They arc graphed using the parameter (usually t) to figure out 
both the x and y coordinates. 

Example 1: 

x = t 
y=t 2 

Note: This parametric equation is equivalent to the rectangular equation y = x 2 . 

Example 2: 

X = cos t 
y = sint 

Note: This parametric equation is equivalent to the rectangular equation x 2 + y 2 - 1 and the polar 
equation r = 1 . 

Parametric equations can be plotted by using a t-table to show values of x and y for each value of t. 
They can also be plotted by eliminating the parameter though this method removes the parameter’s 
importance. 


39.2 Forms of Parametric Equations 

Parametric equations can be described in three ways: 

• Parametric form 

• Vector form 

• An equality 

The first two forms arc used far more often, as they allow us to find the value of the component at 
the given value of the parameter. The final form is used less often; it allows us to verify a solution 
to the equation, or find the parameter (or some constant multiple thereof). 


39.2.1 Parametric Form 

A parametric equation can be shown in parametric form by describing it with a system of equations. 
For instance: 
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x= t 
y=t 2 - 1 

39.2.2 Vector Form 

Vector form can be used to describe a parametric equation in a similar manner to parametric form. 
In this case, a position vector is given: [x,y] = [t, t 2 - 1] 

39.2.3 Equalities 

A parametric equation can also be described with a set of equalities. This is done by solving for the 
parameter, and equating the components. For example: 

x= t 

y=t 2 - 1 

From here, we can solve for t: 
t = x 

t = ±yj\ + y 

And hence equate the two right-hand sides: 
x = ±^l + y 

39.3 Converting Parametric Equations 

There are a few common place methods used to change a parametric equation to rectangular form. 
The first involves solving for t in one of the two equations and then replacing the new expression 
for t with the variable found in the second equation. 

Example 1: 

x= t- 3 
y=t 2 

x = t - 3 becomes x + 3 = t 
y = (x + 3) 2 

Example 2: 

Given 
x = 3cos 6 
y = 4sin0 

Isolate the trigonometric functions 
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COS0 = | 

sin0 = | 

Use the "Beloved Identity" 
cos 2 8 + sin 2 6 = 1 
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40 Differentiation 


40.1 Taking Derivatives of Parametric Systems 

Just as we arc able to differentiate functions of x, we arc able to differentiate x and y, which arc 
functions of t. Consider: 

x = sin t 

y=t 

We would find the derivative of x with respect to t, and the derivative of y with respect to t: 
x' = cos t 

y' = 1 

In general, we say that if 
x = f{t) and y = g(f) then: 
x' = f{t) and y' = g'it) 

It’s that simple. 

This process works for any amount of variables. 


40.2 Slope of Parametric Equations 

In the above process, x’ has told us only the rate at which x is changing, not the rate for y, and vice 
versa. Neither is the slope. 

In order to find the slope, we need something of the form 

We can discover a way to do this by simple algebraic manipulation: 

L=3L = dy 

X 1 dx dx 
dt 

So, for the example in section 1, the slope at any time t: 
i^7= sec? 

In order to find a vertical tangent line, set the horizontal change, or x’, equal to 0 and solve. 

In order to find a horizontal tangent line, set the vertical change, or y’, equal to 0 and solve. 

If there is a time when both x’ and y’ arc 0, that point is called a singular point. 
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40.3 Concavity of Parametric Equations 

Solving for the second derivative of a parametric equation can be more complex that it may seem at 
first glance. When you have take the derivative of ^ in terms of t, you arc left with 

, d 2 y 

d r I _ dx 

dt^ dx' dr ' 

By multiplying this expression by ^ , we are able to solve for the second derivative of the parametric 
equation: 

d2 y >9 

y dt _ d z y 

dt dx dx 2 ' 

Thus, the concavity of a parametric equation can be described as: 

d r dy | x dt_ 
dt^dx‘ dx 

So for the example in sections 1 and 2, the concavity at any time t: 

J~ t [esc t ] x cos t=- CSC 2 t X cos t 
1 


1 http://en.wikibooks.org/wiki /Cate gor y%3ACalculus%2 0%2 8book%2 9 
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41 Integration 


41.1 Introduction 

Because most parametric equations are given in explicit form, they can be integrated like many other 
equations. Integration has a variety of applications with respect to parametric equations, especially 
in kinematics and vector calculus. 

x = f x'{t)dt 

y = fy'{t)dt 
l 


1 http://en.wikibooks.org/wiki/ Category%3ACalculus%2 0%2 8book%2 9 
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42 Polar Equations 


introduction 


90 ° 



Figure 49: A polar grid with several angles labeled in degrees 


The polar coordinate system is a two-dimensional coordinate system in which each point on a 
plane is determined by an angle and a distance. The polar coordinate system is especially useful in 
situations where the relationship between two points is most easily expressed in terms of angles and 
distance; in the more familiar Cartesian coordinate system or rectangular coordinate system, such a 
relationship can only be found through trigonometric formulae. 

As the coordinate system is two-dimensional, each point is determined by two polar coordinates: the 
radial coordinate and the angular coordinate. The radial coordinate (usually denoted as r) denotes 
the point’s distance from a central point known as the pole (equivalent to the origin in the Cartesian 
system). The angular coordinate (also known as the polar angle or the azimuth angle, and usually 
denoted by or t) denotes the positive or anticlockwise (counterclockwise) angle required to reach 


243 


Polar Equations 


the point from the 0° ray or polar axis (which is equivalent to the positive x-axis in the Cartesian 
coordinate plane). 


42. 1 Plotting points with polar coordinates 


( 3 , 60 °) 



Figure 50: The points (3,60°) and (4,210°) on a polar coordinate system 


Each point in the polar coordinate system can be described with the two polar coordinates, which are 
usually called r (the radial coordinate) and (the angular coordinate, polar angle, or azimuth angle, 
sometimes represented as or t). The r coordinate represents the radial distance from the pole, and 
the coordinate represents the anticlockwise (counterclockwise) angle from the 0° ray (sometimes 
called the polar axis), known as the positive x-axis on the Cartesian coordinate plane. 

For example, the polar coordinates (3, 60°) would be plotted as a point 3 units from the pole on the 
60° ray. The coordinates (&minus;3, 240°) would also be plotted at this point because a negative 
radial distance is measured as a positive distance on the opposite ray (the ray reflected about the 
origin, which differs from the original ray by 180°). 

One important aspect of the polar coordinate system, not present in the Cartesian coordinate system, 
is that a single point can be expressed with an infinite number of different coordinates. This is 
because any number of multiple revolutions can be made around the central pole without affecting 
the actual location of the point plotted. In general, the point ( r , ) can be represented as (r, ± nx 360°) 
or (r, ± (2n + 1)180°), where n is any integer. 

The arbitrary coordinates (0, ) are conventionally used to represent the pole, as regardless of the 
coordinate, a point with radius 0 will always be on the pole. To get a unique representation of a 
point, it is usual to limit r to negative and non-negative numbers r 0 and to the interval [0, 360°) 
or (180°, 180°] (or, in radian measure, [0, 2) or (, ]). 
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Angles in polar notation are generally expressed in either degrees or radians, using the conversion 2 
rad = 360°. The choice depends largely on the context. Navigation applications use degree measure, 
while some physics applications (specifically rotational mechanics) and almost all mathematical 
literature on calculus use radian measure. 

42. 1 . 1 Converting between polar and Cartesian coordinates 


Figure 5 1 : A diagram illustrating the conversion formulae 

The two polar coordinates r and can be converted to the Cartesian coordinates x and y by using the 
trigonometric functions sine and cosine: 




x = rcos 6 


y = r sind, 


while the two Cartesian coordinates x and y can be converted to polar coordinate r by 



(by a simple application of the Pythagorean theorem). 

To determine the angular coordinate , the following two ideas must be considered: 
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• For r = 0, can be set to any real value. 

• For r 0, to get a unique representation for , it must be limited to an interval of size 2. Conventional 
choices for such an interval arc [0, 2) and (&minus;, ]. 

To obtain in the interval [0, 2), the following may be used (arctan denotes the inverse of the tangent 
function): 


e = < 


arctan(^) 
arctan(^) + 2n 
arctan(^) + n 


n 

2 

3 n 
2 


if x > 0 and y > 0 
if x > 0 and y < 0 
if x< 0 

if x = 0 and y > 0 
if x = 0 and y < 0 


To obtain in the interval (&minus;, ], the following may be used: 


arctan(^) 
arctan(^) + n 


e = { 


arctan(^) -n 


n 

2 


n 

2 


if x > 0 

if x < 0 and y > 0 
if x < 0 and y < 0 
if x = 0 and y > 0 
if x = 0 and y < 0 


One may avoid having to keep track of the numerator and denominator signs by use of the atan2 
function, which has separate arguments for the numerator and the denominator. 


42.2 Polar equations 

The equation defining an algebraic curve expressed in polar coordinates is known as a polar equa- 
tion. In many cases, such an equation can simply be specified by defining r as a function of . The 
resulting curve then consists of points of the form (r(), ) and can be regarded as the graph of the 
polar function r. 

Different forms of symmetry can be deduced from the equation of a polar function r. If r() = r() the 
curve will be symmetrical about the horizontal (07180°) ray, if r() = r() it will be symmetric about 
the vertical (907270°) ray, and if r(°) = r() it will be rotationally symmetric ° counterclockwise 
about the pole. 

Because of the circular nature of the polar coordinate system, many curves can be described by a 
rather simple polar equation, whereas their Cartesian form is much more intricate. Among the best 
known of these curves arc the polar rose, Archimedean spiral, lemniscate, I i matron, and cardioid. 

For the circle, line, and polar rose below, it is understood that there are no restrictions on the domain 
and range of the curve. 
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42.2.1 Circle 



Figure 52: A circle with equation r() = 1 

The general equation for a circle with a center at (ro, ) and radius a is 

r 2 - 2r ro cos{9 - ip) + = a 2 . 

This can be simplified in various ways, to conform to more specific cases, such as the equation 

r{9) = a 
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for a circle with a center at the pole and radius a. 

42.2.2 Line 

Radial lines (those running through the pole) arc represented by the equation 

8 = q> 

where is the angle of elevation of the line; that is, = arctan m where m is the slope of the line in 
the Cartesian coordinate system. The non-radial line that crosses the radial line = perpendicularly 
at the point (ro, ) has the equation 

r{8 ) - rosec(0 - <p). 
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42.2.3 Polar rose 



Figure 53: A polar rose with equation r() = 2 sin 4 


A polar rose is a famous mathematical curve that looks like a petaled flower, and that can be ex- 
pressed as a simple polar equation, 


r{9) = a cos(k9 + (f>o) 

for any constant ipo (including 0). If k is an integer, these equations will produce a /c-pctalcd rose if 
k is odd, or a 2 /c-pctalcd rose if k is even. If k is rational but not an integer, a rose-like shape may 
form but with overlapping petals. Note that these equations never define a rose with 2, 6, 10, 14, 
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etc. petals. The variable a represents the length of the petals of the rose. 


42.2.4 Archimedean spiral 



The Archimedean spiral is a famous spiral that was discovered by Archimedes, which also can be 
expressed as a simple polar equation. It is represented by the equation 


r{9) = a + bd. 
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Changing the parameter a will turn the spiral, while b controls the distance between the arms, 
which for a given spiral is always constant. The Archimedean spiral has two arms, one for > 0 
and one for < 0. The two arms are smoothly connected at the pole. Taking the mirror image of 
one arm across the 907270° line will yield the other arm. This curve is notable as one of the first 
curves, after the CONIC SECTIONS 1 , to be described in a mathematical treatise, and as being a 
prime example of a curve that is best defined by a polar equation. 


42.2.5 Conic sections 



Figure 55: Ellipse, showing semi-latus rectum 


A conic section with one focus on the pole and the other somewhere on the 0° ray (so that the conic’s 
semi-major axis lies along the polar axis) is given by: 

£ 

r = 

1 + ecos 6 

where e is the eccentricity and £ is the semi-latus rectum (the perpendicular distance at a focus from 
the major axis to the curve). Ife> 1 . this equation defines a hyperbola; i f e = 1, it defines a parabola; 
and if e < 1, it defines an ellipse. The special case e = 0 of the latter results in a circle of radius £. 


1 http://en.wikibooks.org/wiki/Conic%20Sections 
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43 Differentiation 


43.0.6 Differential calculus 

We have the following formulas: 


r i = x Tx + yi~y 


— - -V—+X — 

de - y dx ' x dy' 


To find the Cartesian slope of the tangent line to a polar curve rQ at any given point, the curve is 
first expressed as a system of parametric equations. 


x = r(0) cos0 


y = r(0)sin0 

Differentiating both equations with respect to yields 

|| = r'(0) cos 0 - r(0 ) sin0 
^ = r'{6) sin0 + r{6) cosd 

Dividing the second equation by the first yields the Cartesian slope of the tangent line to the curve 
at the point (r, rQ): 


dy _ r'{9) sin0 + r(0) cosd 
dx r' (0) cos 0 - r (0) sin 0 


l 


1 http://en.wikibooks.org/wiki/ Category%3ACalculus%2 0%2 8book%2 9 
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44 Integration 


44.1 Introduction 

Integrating a polar equation requires a different approach than integration under the Cartesian sys- 
tem, hence yielding a different formula, which is not as straightforward as integrating the function 

fM- 


44.2 Proof 

In creating the concept of integration, we used Riemann sums of rectangles to approximate the area 
under the curve. However, with polar graphs, one can use sectors of circles with radius r and angle 
measure d. The area of each sector is then (r 2 )(d/2) and the sum of all the infinitesimally small 
sectors’ areas is r 2 dO, </ux/u> This is the form to use to integrate a polar expression of the 
form r = /(0) where (a, f(a)) and (b,f(b)) are the ends of the curve that you wish to integrate. 


44.2.1 Integral calculus 



0 = a 

Figure 56: The integration region R is bounded by the curve r = f(0) and the rays 0 = a and 0 = b. 
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Let R denote the region enclosed by a curve r = f (0) and the rays 6 = a and 0 = b, where 0 < b— a < 
2 n. Then, the area of R is 


Figure 57: The region R is approximated by n sectors (here, n = 5). 

This result can be found as follows. First, the interval [a, b] is divided into n subintervals, where 
n is an arbitrary positive integer. Thus 9, the length of each subinterval, is equal to b - a (the total 
length of the interval), divided by n, the number of subintervals. For each subinterval i = 1,2 
let Oi be the midpoint of the subinterval, and construct a circular sector with the center at the origin, 
radius r, = f{9i), central angle 80, and arc length r,-50. The area of each constructed sector is 
therefore equal to ^rf59. Hence, the total area of all of the sectors is 


As the number of subintervals n is increased, the approximation of the area continues to improve. 
In the limit as n — oo, the sum becomes the Riemann integral. 

Generalization 

Using Cartesian coordinates, an infinitesimal area element can be calculated as dA = dxdy. The 
substitution rule for multiple integrals states that, when using other coordinates, the Jacobian deter- 
minant of the coordinate conversion formula has to be considered: 




0 


'• 6 = b 



n 



256 


Applications 


/ = det 


d(x, y) 
dir, 6) 


dx 

dx 


dr 

dB 


dy 

dy 


dr 

dd 



cosd -rsind 
sind rcosd 


= r cos 2 9 + r sin 2 9 -r. 


Hence, an area element in polar coordinates can be written as 

dA = Jdrd9 = rdrd9. 

Now, a function that is given in polar coordinates can be integrated as follows: 

•» b rr{6) 


rr ro nrw) 

// g(r,d) dA = / / g{r,9)r dr d9. 

JJR Ja Jo 


Here, R is the same region as above, namely, the region enclosed by a curve r = /(d) and the rays 
9 = a and 9 = b. 


The formula for the area of R mentioned above is retrieved by taking g identically equal to 1 . 


44.3 Applications 

Polar integration is often useful when the corresponding integral is either difficult or impossible to 
do with the Cartesian coordinates. For example, let’s try to find the area of the closed unit circle. 
That is, the area of the region enclosed by x 2 + y 2 = 1. 


In Cartesian 




■x 2 dx 


In order to evaluate this, one usually uses trigonometric substitution. By setting sind = x, we get 
both cosd = Vl-x 2 and cosd d9 = dx. 


j V l- x 2 dx = J cos 2 9d9 


r l l 

= / - + -cos2 9d9 

J 2 2 

9 1 9 1 

= - + - sin2 9 + c= - + - sind cosd + c 
2 4 2 2 


arcsinx 


xVT- 


■ x 


+ c 


Putting this back into the equation, we get 


f / 7 , 

arcsinx xVl-x 2 

I v 1 - x 2 dx = 2 

+ 

J- 1 

2 2 


= arcsin 1 - arcsin(- 1) = n 


-l 
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In Polar 

To integrate in polar coordinates, we first realize r = \J x 2 + y 2 = \f\ = 1 and in order to include the 
whole circle, a - 0 and b = 2n. 



r dr d6 = 



r 2 

2~ 


r 2n 1 

dO = \ -dd = 
o Jo 2 


6 

2 


2n 

0 



44.3.1 An interesting example 


A less intuitive application of polar integration yields the Gaussian integral 



dx = \fn. 


Try it! (Hint: multiply f^e x ~ dx and /'^ e r dy.) 
l 


1 http://en.wikibooks.org/wiki /Cate gor y%3ACalculus%2 0%2 8book%2 9 
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46 Sequences 


A sequence is an ordered list of objects (or events). Like a set, it contains members (also called 
elements or terms), and the number of terms (possibly infinite) is called the length of the sequence. 
Unlike a set, order matters, and exactly the same elements can appear multiple times at different 
positions in the sequence. 

For example, (C, R, Y) is a sequence of letters that differs from (Y, C, R), as the ordering matters. 
Sequences can be finite, as in this example, or infinite, such as the sequence of all even positive 
integers (2, 4, 6,...). 


X 

n 



Figure 58: An infinite sequence of real numbers (in blue). This sequence is neither increasing, nor 
decreasing, nor convergent. It is however bounded. 


46.1 Examples and notation 


There are various and quite different notions of sequences in mathematics, some of which (e.g., 
exact sequence) arc not covered by the notations introduced below. 

A sequence may be denoted (a\, cii, ...). For shortness, the notation (a n ) is also used. 

A more formal definition of a finite sequence with terms in a set S is a function from {1, 2, ..., 
n} to S for some n 0. An infinite sequence in S is a function from {1, 2, ...} (the set of natural 
numbers without 0) to S. 

Sequences may also start from 0, so the first term in the sequence is then oq. 
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A finite sequence is also called an n-tuple. Finite sequences include the empty sequence ( ) that has 
no elements. 

A function from all integers into a set is sometimes called a bi-infinite sequence, since it may be 
thought of as a sequence indexed by negative integers grafted onto a sequence indexed by positive 
integers. 


46.2 Types and properties of sequences 


A subsequence of a given sequence is a sequence formed from the given sequence by deleting some 
of the elements (which, as stated in the introduction, can also be called "terms") without disturbing 
the relative positions of the remaining elements. 

If the terms of the sequence are a subset of an ordered set, then a monotonically increasing sequence 
is one for which each term is greater than or equal to the term before it; if each term is strictly greater 
than the one preceding it, the sequence is called strictly monotonically increasing. A monotonically 
decreasing sequence is defined similarly. Any sequence fulfilling the monotonicity property is called 
monotonic or monotone. This is a special case of the more general notion of monotonic function. 

The terms non- decreasing and non-increasing are used in order to avoid any possible confusion 
with strictly increasing and strictly decreasing, respectively. If the terms of a sequence are integers, 
then the sequence is an integer sequence. If the terms of a sequence are polynomials, then the 
sequence is a polynomial sequence. 

If S is endowed with a topology, then it becomes possible to consider convergence of an infinite 
sequence in S. Such considerations involve the concept of the limit of a sequence. 


46.3 Sequences in analysis 


In analysis, when talking about sequences, one will generally consider sequences of the form 


(Xi,x 2 ,x 3 ,...) 


or 

(X 0 ,Xi,X 2 ,...) 

which is to say, infinite sequences of elements indexed by natural numbers. (It may be convenient 
to have the sequence start with an index different from 1 or 0. For example, the sequence defined 
by x n = l/log(n) would be defined only for n> 2. When talking about such infinite sequences, it is 
usually sufficient (and does not change much for most considerations) to assume that the members 
of the sequence are defined at least for all indices large enough, that is, greater than some given IV.) 

The most elementary type of sequences are numerical ones, that is, sequences of real or complex 
numbers. 


262 



47 Series 


47.1 Introduction 

A series is the sum of a sequence of terms. An infinite series is the sum of an infinite number of 
terms (the actual sum of the series need not be infinite, as we will see below). 

An arithmetic series is the sum of a sequence of terms with a common difference (the difference 
between consecutive terms). For example: 

1+4 + 7+10+13 + .. . 

is an arithmetic series with common difference 3, since a 2 — a\ = 3, a 2 - a 2 = 3, and so forth. 

A geometric series is the sum of terms with a common ratio. For example, an interesting series 
which appeal's in many practical problems in science, engineering, and mathematics is the geometric 
series r + r 2 + r 3 + r 4 + ... where the ... indicates that the series continues indefinitely. A common 
way to study a particular' series (following Cauchy) is to define a sequence consisting of the sum of 
the first n terms. For example, to study the geometric series we can consider the sequence which 
adds together the first n terms: 


S n (r ) = Y, r ‘- 

i=t 

Generally by studying the sequence of partial sums we can understand the behavior of the entire 
infinite series. 

Two of the most important questions about a series are 

• Does it converge? 

• If so, what does it converge to? 

For example, it is fairly easy to see that for r > 1, the geometric series S n (r) will not converge to 
a finite number (i.e., it will diverge to infinity). To see this, note that each time we increase the 
number of terms in the series, S n [r ) increases by r” +1 , since r ' l+1 > 1 for all r > 1 (as we defined), 
S„(r) must increase by a number greater than one every term. When increasing the sum by more 
than one for every term, it will diverge. 

Perhaps a more surprising and interesting fact is that for |r| < 1, S n (r) will converge to a finite value. 
Specifically, it is possible to show that 


r 

lim S„(r) = . 

n—-oo 1 — T 
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Indeed, consider the quantity 


n n n + 1 

(1 - r)S n [r) = (1 - r) ^ r n - £ r n - £ r n = r-r n+1 

i= 1 i = 1 i=2 

Since r” +1 — 0 as n — > • oo for \r\ < 1, this shows that (1 - r)S n (r ) — r as n — ► oo. The quantity 1 - r 
is non-zero and doesn’t depend on n so we can divide by it and arrive at the formula we want. 

We’d like to be able to draw similar conclusions about any series. 

Unfortunately, there is no simple way to sum a series. The most we will be able to do in most cases 
is determine if it converges. The geometric and the telescoping series arc the only types of series in 
which we can easily find the sum of. 


47.2 Convergence 


It is obvious that for a series to converge, the a n must tend to zero (because sum of any infinite 
terms is infinity, except when the sequence approaches 0), but even if the limit of the sequence is 0, 
is not sufficient to say it converges. 

Consider the harmonic series, the sum of 1 In, and group terms 


V 2 m I 
4-1 n 


~ l + \ + 
>3 + 

2 ' 

= 3 + 

2 ' 


1 + 1 + 1+ 1 + 1 + i + 

3 ^ 4 t 56 t 78 t 

b 


b 


+ Y 2 " I 

z -l+2"- 1 p 


+ - 


+—2 n ~ l 

t 2" 

(m terms) 


As m tends to infinity, so does this final sum, hence the series diverges. 

We can also deduce something about how quickly it diverges. Using the same grouping of terms, 
we can get an upper limit on the sum of the first so many terms, the partial sums. 


or 


m 1 

1 h <) — < 1 + m 

2 Y n 


ln 2 m ^ 1 

1 h < ) — < 1 + In 2 m 

2 Y n 


and the partial sums increase like log m, very slowly. 

Notice that to discover this, we compared the terms of the harmonic series with a series we knew 
diverged. 

This is a convergence test (also known as the direct comparison test) we can apply to any pair of 
series. 

• If b n converges and \a n \<\b n \ then a n converges. 
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• If b n diverges and \a n \>\b n \ then a n diverges. 

There are many such tests, the most important of which we’ll describe in this chapter. 


47.2.1 Absolute convergence 

Theorem: If the series of absolute values, I a n | , converges, then so does the series a n 

We say such a series converges absolutely. 

Proof: 

Let e > 0 

According to the Cauchy criterion for series convergence, exists N so that for all N < m, nr. 

We know that: 

\r k i n a k \<r k n =n \a k \ 

And then we get: 

\r k l n a k \<r k n =n \a k \<e 

Now we get: 

II T=n a k\<e 

Which is exactly the Cauchy criterion for series convergence. 

Q.E.D 

The converse does not hold. The series 1-1/2+ 1/3- 1/4 ... converges, even though the series of its 
absolute values diverges. 

A series like this that converges, but not absolutely, is said to converge conditionally. 

If a series converges absolutely, we can add terms in any order we like. The limit will still be the 
same. 

If a series converges conditionally, rearranging the terms changes the limit. In fact, we can make 
the series converge to any limit we like by choosing a suitable rearrangement. 

E.g., in the series 1-1/2+ 1/3- 1/4 ..., we can add only positive terms until the partial sum exceeds 
100, subtract 1/2, add only positive terms until the partial sum exceeds 100, subtract 1/4, and so on, 
getting a sequence with the same terms that converges to 100. 

This makes absolutely convergent series easier to work with. Thus, all but one of convergence tests 
in this chapter will be for series all of whose terms arc positive, which must be absolutely convergent 
or divergent series. Other series will be studied by considering the corresponding series of absolute 
values. 
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47.2.2 Ratio test 

For a series with terms a n , if 


lim 

re— » oo 


^re+1 

d n 


= r 


then 

• the series converges (absolutely) if r< 1 

• the series diverges if r> 1 (or if r is infinity) 

• the series could do either if r=l, so the test is not conclusive in this case. 
E.g., suppose 


ci n — 


n\n\ 
(2 n)\ 


then 


a n+ 1 _ [n+ l) 2 _ n + 1 1 

a n (2n+l)(2n + 2) An + 2 4 


so this series converges. 


47.2.3 Integral test 

If j[x) is a monotonically decreasing, always positive function, then the series 


OO 


L 

n - 1 


converges if and only if the integral 

OO 

f{x)dx 

converges. 

E.g., consider fx)=l/x p , for a fixed p. 

• If p=l this is the harmonic series, which diverges. 

• I f' jX I each term is larger than the harmonic series, so it diverges. 

• If p>\ then 


fi°x p dx 


lim^ooZ/x p dx 

-l 5 
lim^oo (p _ ])xP , i 

lim^oo ^ (p_i) s p-ij — p-i 
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The integral converges, for p> 1, so the series converges. 
We can prove this test works by writing the integral as 


roo oo rn+1 

/ f{x)dx=Y^ / f{x)dx 
J1 n— 

and comparing each of the integrals with rectangles, giving the inequalities 

pn + 1 

/(«)>/ f(x)dx>f{n+ 1 ) 

J n 

Applying these to the sum then shows convergence. 


47.2.4 Limit comparison test 

Given an infinite series Y a n with positive terms only, if one can find another infinite series Y b n 
with positive terms for which 


n *oo b n 

for a positive and finite L (i.e., the limit exists and is not zero), then the two series either both 
converge or both diverge. That is, 

• Y a u converges if Y b„ converges, and 

• Y a n diverges if Y b n diverges. 

Example: 


_ M+l 
Clyi — Yl n 

For large n, the terms of this series are similar to, but smaller than, those of the harmonic series. We 
compare the limits. 


lim^ = lim- 
b)% 


II n 


= lim 


n + 1 

n n 


= lim — - = 1 > 0 

nn 


so this series diverges. 


47.2.5 Alternating series 

Given an infinite series Y a n , if the signs of the a D alternate, that is if 


&n — ( 1)1 I 
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for all n or 


a n = (-l) n+1 \a n \ 

for all n, then we call it an alternating series. 

The alternating series test states that such a series converges if 


lim a„ = 0 

n — *oo 


and 


I a n+l I ^ I I 

(that is, the magnitude of the terms is decreasing). 

Note that this test cannot lead to the conclusion that the series diverges; if one cannot conclude that 
the series converges, this test is inconclusive, although other tests may, of course, be used to give a 
conclusion. 


Estimating the sum of an alternating series 

The absolute error that results in using a partial sum of an alternating series to estimate the final 
sum of the infinite series is smaller than the magnitude of the first omitted term. 


y &n y &n 
n=l n= 1 




47.3 Geometric series 


The geometric series can take either of the following forms 


or 


L ar n 

n = 0 
oo 

L ar n ~ l 

n - 1 


As you have seen at the start, the sum of the geometric series is 


Sn = 


1 - r 


for Irl < 1 
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47.4 Telescoping series 


oo 


Y (b n 

n = o 


h«+ 1) 


Expanding (or "telescoping") this type of series is informative. If we expand this series, we get: 
k 

Y - b n+i> = (&o - bi) + (hi - h 2 ) + ... + (hfc-1 - b k ) 

n = 0 


Additive cancellation leaves: 


k 


Yib n -b n+ i) = b 0 -b k 

n = 0 


Thus, 


Y ( bn 

n = 0 


n) = lim Y ( b n 

k—oo n =o 


b n +i) = lim (h 0 - b k ) = h 0 - lim 

k—> oo k—*oo 


b k 


and all that remains is to evaluate the limit. 

There are other tests that can be used, but these tests are sufficient for all commonly encountered 
series. 

l 


1 http://en.wikibooks.org/wiki/ Category%3ACalculus%2 0%2 8book%2 9 
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49 Taylor Series 

49.1 Taylor Series 



Figure 59 


sin(x) and Taylor approximations, polynomials of degree 1, 3, 5, 7,9, 11 and 13. 

The Taylor series of an infinitely often DIFFERENTIABLE 1 real (or complex) FUNCTION 2 /defined 
on an OPEN INTERVAL 3 ( a-r , a+r) is the POWER SERIES 4 „, ta) ( x-a) n 

Flere, nl is the factorial'’ of n and/ 1 ”/ a) denotes the nth DERIVATIVE 6 of /at the point a. If this 
series converges for every x in the interval (a-r, a+r) and the sum is equal to fix), then the function 
fix) is called analytic. To check whether the series converges towards fix), one normally uses 
estimates for the remainder term of Taylor’S theorem 7 . A function is analytic if and only if a 


1 

2 

3 

4 

5 

6 
7 


http://en.wikipedia.org/wiki/differentiable 

http://en.wikipedia.org/wiki/function%20%28mathematics%29 

http://en.wikipedia.org/wiki/interval%20%28mathematics%29 

http://en.wikipedia.org/wiki/power%20series 

http://en.wikipedia.org/wiki/Factorial 

http://en.wikipedia.org/wiki/Derivative 

HTTP://eN. WIKIPEDIA. 0 RG / W I K I /T A Y L 0 R%2 7 S_T H E OREM 
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POWER SERIES 8 converges to the function; the coefficients in that power series are then necessarily 
the ones given in the above Taylor series formula. 

If a = 0, the series is also called a Maclaurin series. 

The importance of such a power series representation is threefold. First, differentiation and inte- 
gration of power series can be performed term by term and is hence particularly easy. Second, 
an analytic function can be uniquely extended to a HOLOMORPHIC FUNCTION 9 defined on an open 
disk in the COMPLEX PLANE 10 , which makes the whole machinery of COMPLEX ANALYSIS 11 avail- 
able. Third, the (truncated) series can be used to approximate values of the function near the point 
of expansion. 



Figure 60 


The function e 11x2 is not analytic: the Taylor series is 0, although the function is not. 

Note that there are examples of INFINITELY OFTEN DIFFERENTIABLE FUNCTION 12 s j{x) whose 
Taylor series converge, but are nor equal to j{x). For instance, for the function defined piecewise by 
saying that J{x) = exp(&minus; Mx 1 ) i f x f 0 and fiO) = 0, all the derivatives are zero at x = 0, so 
the Taylor series of fx) is zero, and its RADIUS OF CONVERGENCE 13 is infinite, even though the 
function most definitely is not zero. This particular pathology does not afflict COMPLEX 14 -valued 
functions of a complex variable. Notice that cxp(&minus; l/z 2 ) does not approach 0 as z approaches 
0 along the imaginary axis. 

Some functions cannot be written as Taylor series because they have a SINGULARITY 15 ; in these 
cases, one can often still achieve a series expansion if one allows also negative powers of the variable 
x; see LAURENT series 16 . For example, fix) = exp(&minus; l/x 2 ) can be written as a Laurent series. 


8 

9 

10 
11 
12 

13 

14 

15 

16 


http ://en. wikipedia.org/wiki /Power_seri e s 
http://en.wikipedia.org/wiki/Holomorphic%20function 
http://en.wikipedia.org/wiki /Comp lex_n umber 
http://en.wikipedia.org/wiki/Complex%20analysis 

http://eN.WIKIPEDIA.ORG/wIKI/INF I NI TEL Y_0F TEN_D IFFERENT I ABLE_FUNCT I ON 
http://en.wikipedia.org/wiki/Rad I US_OF_CONVERGENCE 
http://en.wikipedia.org/wiki /Comp lex_n umber 

HTTP ://EN. WIKIPEDIA.ORG/wiK I /S INGULARI T Y_%2 8 M AT H E M AT I C S%2 9 
http://en.wikipedia.org/wiki /Laurent_serie s 
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The PARKER-SOCKACKI THEOREM 17 is a recent advance in finding Taylor series which arc solu- 
tions to DIFFERENTIAL EQUATIONS 18 . This theorem is an expansion on the PICARD ITERATION 19 . 

49. 1 . 1 Derivation/ why this works 

If a function f(x) is written as a infinite power series, it will look like this: 

f(x)=co(x-a) 0 +ci(x-a) 1 +C2(x-a) 2 +C3(x-a) 3 +C4(x-a) 4 +C5(x-a) 5 +C6(x-a) 6 +C7(x-a) 7 +... 

where a is half the radius of convergence and co,ci, 02,03,04... arc coefficients. If we substitute a for 
x: 

f(a)=c 0 

If we differentiate: 

f(x)=lci(x-a) 0 +2c2(x-a) 1 +3c3(x-a) 2 +4c4(x-a) 3 +5c5(x-a) 4 +6c6(x-a) 5 +7c7(x-a) 6 +... 

If we substitute a for x: 

f'(a)=lci 

If we differentiate: 

f"(x)=2o2+3*2*C3(x-a) 1 +4*3*C4(x-a) 2 +5*4*C5(x-a) 3 +6*5*C6(x-a) 4 +7*6*C7(x-a) 5 +... 

If we substitute a for x: 
f"(a)=2c 2 
Extrapolating: 
nlc^fXa) 

where f°(x)=f(x) and f 1 (x)=P(x) and so on. We can actually go ahead and say that the power approx- 
imation of f(x) is: 

f(x)=S n= o &#165; ((f n (a)/n!)*(x-a) n ) 

<this needs to be improved> 

49. 1 .2 List of Taylor series 

Several important Taylor series expansions follow. All these expansions arc also valid for complex 
arguments x. 

Exponential function 20 and natural logarithm 21 : 


17 

18 

19 

20 
21 


http://wWW.MATH.JMU.EDU/~OjIM/pICARD. HTML 
HTTP ://EN. WIKIPEDIA. ORG/w I K I / D I F F E RE N T I AL_E QU AT IONS 
HTTP ://EN. WIKIPEDIA. ORG/W I K I /P I C ARD_I T E R AT ION 
HTTP ://EN. WIKIPEDIA. ORG/W I K I /EX P ONE N T I AL_F U N C T ION 
HTTP ://EN. WIKIPEDIA. ORG/W I K I /NA T U RAL_L 0 G AR I T HM 
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= Y — for all x 


n = 0 n - 


Geometric series 22 : 


ln(l + x)=Y x” for 1^1 < 1 


n - 1 


— = Y x " for w < 1 

1 — x n = 0 


Binomial series 23 : 


(1 + x) a = Y C(a, /*) x ” f° r a H l x l < 1 and all complex a 
n = 0 


Trigonometric function 24 s: 


(-D" 2n+l 


sinx= Y 

„=o(2n+l)! 


x z " +i for all x 


(-D" J2n 


cosx= V X for all x 


B 2n (- 4) ”(1-4") 


tanx = V 

ti (2/i)! 


x 2 ” 1 for |x| < — 


C - l )”£2 


secx = y 

( 2 «) ! 


n 2n 


7T 


x for |x| < — 
2 


arcsinx= y - „ 

,^ 0 4"(rc!) 2 (2rc+l) 


{ J ‘" x 2n+1 for |x| < 1 


f— 11” 

arctanx- y x 2 " +1 for |x| < 1 

^ O _L 1 


77=0 


2^2 + 1 


Hyperbolic function 25 s: 


sinhx = y 


o (2n + 1)! 


-x 2 " +1 for all x 


22 http://en.wikipedia.org/wiki/Geometri c_seri e s 

23 http://en.wikipedia.org/wiki /Bi nomi al_theorem 

24 http://en.wikipedia.org/wiki /Tri gonome tri c_func t ion 

25 http://en.wikipedia.org/wiki/Hyperboli c_funct ion 


276 



Taylor Series 


coshx= V — - — x 2n for all x 

B 2n 4 n {4 n -1) 2n—l f , , * 

tanhx= ) x for x < — 

„=i (2/7)! 2 


sinh 3 x = y 


(— l)”(2rc)! , B+1 


o 4"(n!) z (2n + 1) 


x z " +i for |x| < 1 


tanh : x= y — - — x 2n+1 for |x| < 1 

„= 0 2n + l 


<P> 

Lambert’s W function 26 : 


w 0 {x)= y 

n = 1 


(-n) 


n-1 


ft! 


for | x | < — 
e 


The numbers B ^ appealing in the expansions of tan(x) and tanh(x) arc the Bernoulli numbers 27 . 
The C (a,n) in the binomial expansion arc the BINOMIAL COEFFICIENTS 28 . The in the expansion 
of sec(x) are Euler numbers 29 . 


49.1.3 Multiple dimensions 

The Taylor series may be generalized to functions of more than one variable with 


E ••• E 

Hi —0 n d = 0 


d>h 

dx" 1 


d Hd f(a ir --,a d ) 
dx ,ld ni'.---n d \ 


(xi - ai) ni ■ ■ ■ {x d - a d ) nd 


49.1.4 History 

The Taylor series is named for mathematician Brook Taylor 30 , who first published the power 
series formula in 1715. 


26 

27 

28 

29 

30 


HTTP ://EN. WIKIPEDIA. ORG/w I K I / L AMB E R T %2 7 S_W_F UN C T I ON 

http://en.wikipedia.org/wiki/Bernoulli_numbers 

HTTP ://EN. WIKIPEDIA. ORG/w I K I /B I N OM I AL_C 0 E F F I C I ENT 
HTTP ://EN. WIKIPEDIA. ORG/w IKl/EULER_NUMBERS 

http://en.wikipedia.org/wiki/Brook%20Taylor 
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49. 1.5 Constructing a Taylor Series 

Several methods exist for the calculation of Taylor series of a large number of functions. One can 
attempt to use the Taylor series as-is and generalize the form of the coefficients, or one can use 
manipulations such as substitution, multiplication or division, addition or subtraction of standard 
Taylor series (such as those above) to construct the Taylor series of a function, by virtue of Taylor 
series being power series. In some cases, one can also derive the Taylor series by repeatedly apply- 
ing INTEGRATION BY PARTS 31 . The use of COMPUTER ALGEBRA SYSTEMS 32 to calculate Taylor 
series is common, since it eliminates tedious substitution and manipulation. 


Example 1 

Consider the function 


fix) = ln(l + cosx), 


for which we want a Taylor series at 0. 
We have for the natural logarithm 


ln(l + x) 


E 


(-D 


n + 1 



for | x | < 1 


and for the cosine function 


cosx = 


E 

n = 0 


(2 n)\ 


2 4 

XT X 

1 1 • • 

2! 4! 


for all x e C. 


We can simply substitute the second series into the first. Doing so gives 




X 2 X 4 

1 1 

2! 4! 


X Z X 4 ) 

r i 


X 2 

X 4 

1 1 * * * 1 


1 

— 

H • • 

2! 4! j 

1 + 3 


2! 

4! 


Expanding by using MULTINOMIAL COEFFICIENT 3 -^ gives the required Taylor series. Note that 
cosine and therefore / arc even functions, meaning that fix ) = /(-x), hence the coefficients of the 
odd powers x, x 3 , x 5 , x 7 and so on have to be zero and don’t need to be calculated. The first few 
terms of the series are 


2 4 6 

ln(l + cosx) = ln2 

4 96 1440 


17x b 


31x 


10 


322560 7257600 


The general coefficient can be represented using FaA Dl Bruno’S formula 34 . However, this 
representation does not seem to be particularly illuminating and is therefore omitted here. 


31 http://en.wikipedia.org/wiki/Integrat I on_b y_p art s 

32 http://en.wikipedia.org/wiki /Computer_algebra_s y stem 

33 http://en.wikipedia.org/wiki /Mult inomi al_coeff i c i ent 

34 http://en.wikipedia.org/wiki /Fa%E0_d i_Bruno%2 7 s_formula 
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Example 2 

Suppose we want the Taylor series at 0 of the function 


g(*) = 


cosx 


We have for the exponential function 


and, as in the first example, 


Assume the power series is 


oo Y n r 2 r 3 r 4 

Y « i A A A A 

e = ) — = 1 + X H H + — H 

to n\ 2! 3! 4! 


2 4 

X X 

COS X — 1 H 

2! 4! 


= Co + CiX + C2X 2 + C3X 3 H 


COSX 


Then multiplication with the denominator and substitution of the series of the cosine yields 


e x = (co + cix+ C2X 2 + C3X 3 H — )cosx 
= (co + Cl X + C2X 2 + C3X 3 + C4X 4 H ) 


2 4 

X X 

1 1 • • 

2! 4! 


Co 2 c 0 4 Cl 3 Cl 5 2 c 2 4 C2 g 3 C3 5 C3 7 

= Co X H X +C1X X H X +C2X X H X + C3X X H X +• 

2 4! 2 4! 2 4! 2 4! 

Collecting the terms up to fourth order yields 


= c 0 


Co) f Cl 3 Co C2 

+ C1X+ C2 X + C3 X + C4 H 

2) 2) 1 4 2) 


Co C 2 1 4 

x 4 + — 


Comparing coefficients with the above series of the exponential function yields the desired Taylor 
series 


e x 2 2x 3 x 4 

— 1 + X + X H 1 h • 

cosx 3 2 
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49.2 Convergence 

49.3 Generalized Mean Value Theorem 

35 


35 http://en.wikibooks.org/wiki /Cate gor y%3ACalculus%2 0%2 8book%2 9 
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50 Power Series 


The study of power series is aimed at investigating series which can approximate some function 
over a certain interval. 


50.1 Motivations 

Elementary calculus (DIFFERENTIATION 1 ) is used to obtain information on a line which touches a 
curve at one point (i.e. a tangent). This is done by calculating the gradient, or slope of the curve, 
at a single point. However, this does not provide us with reliable information on the curve’s actual 
value at given points in a wider interval. This is where the concept of power series becomes useful. 


50.1.1 An example 


Consider the curve of y= cos(x), about the point x= 0. A naive approximation would be the line y= 
1 . However, for a more accurate approximation, observe that cos(x) looks like an inverted parabola 
around x = 0 - therefore, we might think about which parabola could approximate the shape of 
cos(x) near this point. This curve might well come to mind: 



In fact, this is the best estimate for cos(x) which uses polynomials of degree 2 (i.e. a highest 
term of x 2 ) - but how do we know this is true? This is the study of power series: finding optimal 
approximations to functions using polynomials. 


50.2 Definition 

A power series is a SERIES 2 of the form 
ao* 0 + a\x l + ... + flnx" 
or, equivalently, 


£ a i xl 
j = o 


1 Chapter 1 . 2 on page 2 

2 Chapter 47 on page 263 


281 



Power Series 


50.3 Radius of convergence 

When using a power series as an alternative method of calculating a function’s value, the equation 

n 

fix) = ^ a,jX J 

7=0 

can only be used to study fix) where the power series converges - this may happen for a finite range, 
or for all REAL NUMBERS 3 . 

The size of the interval (around its center) in which the power series converges to the function is 
known as the radius of convergence. 

50.3.1 An example 


1 OO 

= £ x " 

1 x n = 0 

(a geometric series) 

this converges when I x I < 1, the range - l<x<+l,so the radius of convergence - centered at 0 - is 
1. It should also be observed that at the extremities of the radius, that is where x = 1 and x= -1, the 
power series does not converge. 

50.3.2 Another example 


eX = H —j 

n=0 n\ 


Using the RATIO TEST 4 , this series converges when the ratio of successive terms is less than one: 


lim 

72—00 


x ( " +1) n\ 
[n + 1)! x n 


< 1 


or 


lim 

72—00 


x n x 1 n\ 


n\ in + 1) x n 


lim 

n—>oo 


n + 1 


< 1 


< 1 


which is always true - therefore, this power series has an infinite radius of convergence. In effect, 
this means that the power series can always be used as a valid alternative to the original function. 


3 http://en.wikipedia.org/wiki /Re al_n umber 

4 Chapter 47.2.2 on page 266 
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50.3.3 Abstraction 


If we use the ratio test on an arbitrary power series, we find it converges when 


.. \dn+lX\ 

lim < I 


and diverges when 


lim > I 


The radius of convergence is therefore 


r = lim 


I d-n I 

I dn + 1 1 


If this limit diverges to infinity, the series has an infinite radius of convergence. 


50.4 Differentiation and Integration 

Within its radius of convergence, a power series can be differentiated and integrated term by term. 

d 00 ■ 00 

— Y a i xJ = YU + Dflj+i^- 7 ' 

ax 7=0 7=0 



xi 


Both the differential and the integral have the same radius of convergence as the original series. 
This allows us to sum exactly suitable power series. For example, 


1 p o 

= \- x + x - X + ... 

1 + X 

This is a geometric series, which converges for I x I < 1. Integrating both sides, we get 


2 3 

ln(l + x) = x - — + — . 

2 3 

which will also converge for I x I < 1 . When x = - 1 this is the harmonic series, which diverges’; when 
x- 1 this is an alternating series with diminishing terms, which converges to In 2 - this is testing the 
extremities. 

It also lets us write power series for integrals we cannot do exactly such as the error function: 


283 



Power Series 


e 


E(- D" 


«! 


The left hand side can not be integrated exactly, but the right hand side can be. 


f z . x 2 , — (-1 ) n z 2n+1 

I e x dx=) 

Jo ^ (2«+l)n! 

This gives us a power series for the sum, which has an infinite radius of convergence, letting us 
approximate the integral as closely as we like. 


50.5 Further reading 

• "Decoding the Rosetta Stone " 5 article by Jack W. Crenshaw 2005-10-12 
6 


50.6 Exercises 


1. Assume that the n th partial sum of a SERIES 7 is given by s u = 2 - 

a) Does the series converge? If so, to what value? 

b) What is the formula for the n th term of the series? 


Find the value to which each the following series converges: 

a) 



c) 


d) 


OO 


L 


i 


n 2 - n 


OO 


E 


(-l) n 2 n ~ 1 

3” 


Determine whether each the following series converges or diverges: 


5 HTTP ://EMBEDDED. com/ SHOWARI I CLE ,JHTML?ARTICLE lD=172300631 

6 http://en.wikibooks.org/wiki /Cate gory%3ACalculus%2 0%2 8book%2 9 

7 http://en.wikibooks.org/wiki/Calculus%2FInfinite%20series 
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a) 


OO 


E 


i 


n 


2 


b) 


OO 


E 

n=0 


1 

2/‘ 


c) 


OO 


E 

n - 1 


n 

n 2 + 1 


d) 


E 


n=2 


1 

Inn 


e) 


OO 


E 

n=0 


n! 

2 " 


f) 


OO 


E 


cos 7m 
n 


g) 


OO 


E 


(- 1 )” 
nlnn- 1 


Determine whether each the following series converges conditionally, converges absolutely, 
or diverges: 


a) 


OO 


E 

n=l 


(-i) ra 

\fn 


b) 


OO 


E 


(-1)' ! inn 
n 


c) 

~ (-1 ) n n 

n=2 (bl/7) 2 

d) 

oo (_ 1 )n 2 n 

, e" - 1 
«=i e 1 


e) 


g (-1)” 

n= 1 


sin n 
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f) 


g) 


50.7 Hints 


a) take a limit 

b) 

$n — Sn - 1 + dn 


a) sum of an infinite geometric series 

b) sum of an infinite geometric series 

c) telescoping series 

d) rewrite so that all exponents arc n 


a) p-series 

b) geometric series 

c) limit comparison test 

d) direct comparison test 

e) divergence test 

f) alternating series test 

g) alternating series test 


a) alternating series test; direct comparison test or integral test 

b) alternating series test; integral test or direct comparison test 

c) divergence test 

d) alternating series test (optional); limit comparison test with geometric series 

e) divergence test 

f) ratio test 

g) divergence test 


L 


(-1 ) n n\ 
\ (2 n)\ 


“ {-l) n e lln 


n - 1 


arctan n 


50.8 Answers only 

l. 
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Full solutions 


a) The series converges to 2 

b) 


a) 4 

b) 

2 

e-2 

c) 1 

d) &minus;l/5 


a) converges 

b) converges 

c) diverges 

d) diverges 

e) diverges 

f) converges 

g) diverges 


a) converges conditionally 

b) converges conditionally 

c) diverges 

d) converges absolutely 

e) diverges 

f) converges absolutely 

g) diverges 



50.9 Full solutions 


l. 


a) The series converges to 2 since: 


s = lim s n = lim 


< 1 , 

2 =2 

3 11 


b) 


&n — $n Sn— 1 — | 

1 

2 

q n 


1 

CM 

l \ 


Q Yl~ 1 


3 ) 


V 



1 1 

3"-i ~3" 


3 1 

3" ~ 3” 


2 

3” 
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a) The series is 



n 


and so is geometric with first term a - 3 and common ratio r = 1/4. So 

a 3 

s = = = 4. 

1 - r 1-1/4 

2/e _ 2 

1-2/e ~ e — 2 


c) Note that 


OO 


E 



OO 


E 


1 

n(n- 1) 


OO 


E 


' i 
K n- 1 


by PARTIAL FRACTIONS 8 . So 


N—>oo 


lim 

11 

1- - + 

1 1 

+ 

1 1 

+ ...+ 

N—> OO 

2 

,2 ~~ 3, 


,3 _ 4, 



1 

\N --1 


1 1 
N , ' 


All but the first and last terms cancel out, so 


s = lim 
N—*oo 


1 

1 

N 


= 1. 


d) The series simplifies to 


(— 1)”2 


ri'-iii oo 


L v ' 


(- 2 \ 


on. 9 

»=1 ° z 


n= 1 ■ 


and so is geometric with common ratio 

r = -2/3 

and first term 


-1/3 


Thus 


-1/3 


5 = 


1 - (—2/3) 


= —1/5. 


8 http://en.wikibooks.org/wiki/HSE%20Partial%20Fractions 
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Full solutions 


a) This is a /9-series with 

P = 2 

. Since p> 1, the series converges. 

b) This is a geometric series with common ratio r = 1/2, and so converges since 
I r I < 1. 

c) This series can be compared to a p-series: 

“ n °° n _ 1 

hi n 2 + 1 h\ n 2 hi n 


The 


symbol means the two series are "asymptotically equivalent" — that is, 
they either both converge or both diverge because their terms behave so 
similarly when summed as n gets very large. This can be shown by the 
LIMIT COMPARISON TEST 9 : 


lim 

n—* oo 


n 


\n l + 1 n) 


( n n \ n2 

= lim — ^ = lim — = 1 

oo V n A + 1 It n— oo n z + 1 


Since the limit is positive and finite, the two series either both converge 
or both diverge. The simpler series diverges because it is a /9- scries with 


P = 1 

(harmonic series), and so the original series diverges by the limit com- 
parison test. 


d) This series can be compared to a smaller /;- series: 


OO 


L 


i 

In n 


OO 1 

I 1 

n=2 n 


The /;- scries diverges since 

P = 1 

(harmonic series), so the larger series diverges by the appropriate DIRECT 
COMPARISON TEST 10 . 

e) The terms of this series do not have a limit of zero. Note that when 

n > 1 


9 Chapter 47.2.4 on page 267 

10 http://en.wikibooks.org/wiki/Calculus%2FSeries%23Direct%20comparison%20tests 
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n\ n n - 1 n — 2 
2 " ~~ 2 ' ~~ 2 2 


2 

2 


In In 

->-•( 1 )-- = - 
2 2 2 4 


To see why the inequality holds, consider that when 

n = 2 

none of the fractions in the square brackets above are actually there; when 

n = 3 


only 2/2 (which is the same as 

[n — l]/2 


) is in the brackets; when 


only 3/2 (equal to 
) and 2/2 (equal to 
) are there; when 


n = 4 
[n — l]/2 
[n — 2]/2 


n = 5 

, only 4/2, 3/2, and 2/2 are there; and so forth. Clearly none of these 
fractions are less than 1 and they never will be, no matter what 

n > 1 


is used. Thus 


n\ 

lim — = oo 

n — *-oo 2 n 


Therefore the series diverges by the divergence test. 


f) This is an alternating series: 


OO 


I 


cos nn 
n 


oo 


1 


(-1)" 

n 


Since the sequence 

I (hi I = 

n 

decreases to 0, the series converges by the alternating series test. 
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g) Since the terms alternate, consider the sequence 


I Cl)l I 


1 

Inn - 1 


This sequence is clearly decreasing (since both n and In n arc increas- 
ing — one may also show that the derivative [with respect to n] of the 
expression is negative for 

n > 2 

) and has limit zero (the denominator goes to infinity), so the series con- 
verges by the alternating series test. 


a) This series alternates, so consider the sequence 

1 



Since this sequence is clearly decreasing to zero, the original series is 
convergent by the alternating series test. Now, consider the series formed 
by taking the absolute value of the terms of the original series: 


OO 


E = E 

Yl— 1 


1 

yfh 


This new series can be compared to a /7-series: 


E 


72=1 


1 

y/n 



1 

n 


Since the smaller series diverges, the larger one diverges. But this means 
the original (alternating) series was not absolutely convergent. Thus, the 
original series is only conditionally convergent. 


b) solution to come 

c) solution to come 

d) solution to come 

e) solution to come 

f) solution to come 

g) solution to come 
li 

pt:CAlculo (Volume 3)/Sequencias e series: Exercicios 12 


11 http://en.wikibooks.org/wiki/ Category%3ACalculus%20%28book%2 9 

12 http://pt.wikibooks.org/wiki/ C%E1lculo%20%2 8Volume%2 03%2 9%2FSequ%EAnc i as%2 0e% 
20s%E9ries% 3A% 20Exerc%EDcios 
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52 Vectors 


52.1 Two-Dimensional Vectors 

52.1.1 Introduction 

In most mathematics courses up until this point, we deal with scalars. These are quantities which 
only need one number to express. For instance, the amount of gasoline used to drive to the grocery 
store is a scalar quantity because it only needs one number: 2 gallons. 

In this unit, we deal with vectors. A vector is a directed line segment — that is, a line segment that 
points one direction or the other. As such, it has an initial point and a terminal point. The vector 
starts at the initial point and ends at the terminal point, and the vector points towards the terminal 
point. A vector is drawn as a line segment with an arrow at the terminal point: 



Figure 6 1 : A single vector without coordinate axes. 


The same vector can be placed anywhere on the coordinate plane and still be the same vector — 
the only two bits of information a vector represents are the magnitude and the direction. The 
magnitude is simply the length of the vector, and the direction is the angle at which it points. Since 
neither of these specify a starting or ending location , the same vector can be placed anywhere. To 
illustrate, all of the line segments below can be defined as the vector with magnitude 4y/2 and angle 
45 degrees: 
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8 


6 


4 



2 


* l l l l l l l l 

2 4 6 8 

Figure 62: Multiple locations for the same vector. 

It is customary, however, to place the vector with the initial point at the origin as indicated by the 
black vector. This is called the standard position. 

52.1.2 Component Form 

In standard practice, we don’t express vectors by listing the length and the direction. We instead 
use component form, which lists the height (rise) and width (run) of the vectors. It is written as 
follows: 


run' 

rise ; 


y 


5 



X 



5 


Figure 63: Vector with rise and run measurements. 
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Other ways of denoting a vector in component form include: 


(U X ,Uy) 


and 


(Ux,Uy) 


From the diagram we can now see the benefits of the standard position: the two numbers for the 
terminal point’s coordinates are the same numbers for the vector’s rise and run. Note that we named 
this vector u. Just as you can assign numbers to variables in algebra (usually x, y, and z), you can 
assign vectors to variables in calculus. The letters u, v, and w arc usually used, and either boldface 
or an arrow over the letter is used to identify it as a vector. 

When expressing a vector in component form, it is no longer obvious what the magnitude and 
direction arc. Therefore, we have to perform some calculations to find the magnitude and direction. 


52.1.3 Magnitude 


I u | = ^u 2 + u 2 

where u x is the width, or run, of the vector; u y is the height, or rise, of the vector. You should 
recognize this formula as the Pythagorean theorem. It is — the magnitude is the distance between 
the initial point and the terminal point. 

The magnitude of a vector can also be called the norm. 


52.1.4 Direction 


u 


tanfl = — 


u x 
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Figure 64: Vector triangle with ux and uy labeled. 


where 6 is the direction of the vector. This formula is simply the tangent formula for right triangles. 


52.1.5 Vector Operations 

For these definitions, assume: 


U = 


'u x 

My, 


v = 


Wy. 


Vector Addition 

Vector Addition is often called tip-to-tail addition, because this makes it easier to remember. 

The sum of the vectors you are adding is called the resultant vector, and is the vector drawn from the 
initial point (tip) of the first vector to the terminal point (tail) of the second vector. Although they 
look like the arrows, the pointy bit is the tail, not the tip. (Imagine you were walking the direction 
the vector was pointing... you would start at the flat end (tip) and walk toward the pointy end.) 

It looks like this: 
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(Notice, the black lined vector is the sum of the two dotted line vectors!) 

Numerically: 



Or more generally: 


u + v = 


'u x + iM 

Uy+ Vy) 


Scalar Multiplication 

Graphically, multiplying a vector by a scalar changes only the magnitude of the vector by that same 
scalar. That is, multiplying a vector by 2 will "stretch" the vector to twice its original magnitude, 
keeping the direction the same. 


299 



Vectors 



H 1 1 1 h 

2 4 6 


Figure 66: Multiplication of a vector with a scalar 



Numerically, you calculate the resultant vector with this formula: 


cu = 


cu x 

^CUy j 


, where c is a constant scalar. 

As previously stated, the magnitude is changed by the same constant: 


|cu| = c|u| 

Since multiplying a vector by a constant results in a vector in the same direction, we can reason that 
two vectors are parallel if one is a constant multiple of the other — that is, that u||v if u = cv for 
some constant c. 

We can also divide by a non-zero scalar by instead multiplying by the reciprocal, as with dividing 
regular numbers: 


u 

c 


1 

-u, 0 
c 
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Dot Product 

The dot product, sometimes confusingly called the scalar product, of two vectors is given by: 


or which is equivalent to: 


U'V= U X V X + UyVy 


uv = |u||v| cost? 

where 6 is the angle difference between the two vectors. This provides a convenient way of finding 
the angle between two vectors: 


52.1.6 Applications of Scalar Multiplication and Dot Product 
Unit Vectors 

A unit vector is a vector with a magnitude of 1 . The unit vector of u is a vector in the same 
direction as u, but with a magnitude of 1 : 


Figure 67 : Unit vector 



1 2 3 
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The process of finding the unit vector of u is called normalization. As mentioned in scalar mul- 
tiplication. multiplying a vector by constant C will result in the magnitude being multiplied by C. 
We know how to calculate the magnitude of u. We know that dividing a vector by a constant will 
divide the magnitude by that constant. Therefore, if that constant is the magnitude, dividing the 
vector by the magnitude will result in a unit vector in the same direction as u: 


u 



, where 

w 

is the unit vector of 

u 


Standard Unit Vectors 

A special case of Unit Vectors arc the Standard Unit Vectors i and j: i points one unit directly right 
in the x direction, and j points one unit directly up in the y direction: 



j = 


'O' 

, 1 , 


Using the scalar multiplication and vector addition rules, we can then express vectors in a different 
way: 




= xi + yj 


If we work that equation out, it makes sense. Multiplying x by i will result in the vector 


vO, 


Multiplying y by j will result in the vector 


\y. 


Adding these two together will give us our original 


vector, 


It. 


. Expressing vectors using i and j is called standard form. 
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Projection and Decomposition of Vectors 

Sometimes it is necessary to decompose a vector u into two components: one component parallel 
to a vector v, which we will call uy ; and one component perpendicular to it, u . 



Figure 68: Projection of a vector 


Since the length of up is (|u| -cos0), it is straightforward to write down the formulas for and U|| : 


and 


U|| = |u| * 


(U-V) 

(|u||v|) 


* = (u-v)/(|v| 2 )v 

|v| 


U_L — U — U|| 


Length of a vector 

The length of a vector is given by the dot product of a vector with itself, and 6 = 0 deg: 


uu = |u||u|cos0 = |u| 2 


Perpendicular vectors 


If the angle 6 between two vectors is 90 degrees or | (if the two vectors are orthogonal to each 
other), that is the vectors are perpendicular, then the dot product is 0. This provides us with an easy 


way to find a perpendicular vector: if you have a vector u = 
be found by either 


(“')■ 

\Uy) 


a perpendicular vector can easily 
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v = 



52.1.7 Polar coordinates 

Polar coordinates are an alternative two-dimensional coordinate system, which is often useful when 
rotations arc important. Instead of specifying the position along the x and y axes, we specify the 
distance from the origin, r, and the direction, an angle d. 




Figure 69: Polar coordinates 
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Looking at this diagram, we can see that the values of x and y arc related to those of r and 8 by the 
equations 


x = rcos0 r = a/x 2 + y 2 
y=rsin8 tan 0=| 

Because tan -1 is multivalued, care must be taken to select the right value. 

Just as for Cartesian coordinates the unit vectors that point in the x and y directions arc special, so 
in polar coordinates the unit vectors that point in the r and 9 directions arc also special. 

We will call these vectors r and 8, pronounced r-hat and theta-hat. Putting a circumflex over a vector 
this way is often used to mean the unit vector in that direction. 

Again, on looking at the diagram we see, 

i = rcos0-0sin0 f=^i+^j 
j = rsin0 + 0cos0 0 = -^i+|j 

52.2 Three-Dimensional Coordinates and Vectors 

52.2.1 Basic definition 

Two-dimensional Cartesian coordinates as we’ve discussed so far can be easily extended to three- 
dimensions by adding one more value: ’z’. If the standard (x,y) coordinate axes arc drawn on a 
sheet of paper, the ’z’ axis would extend upwards off of the paper. 
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Figure 70: 3D coordinate axes. 


Similar to the two coordinate axes in two-dimensional coordinates, there are three coordinate 
planes in space. These are the xy-plane, the yz-plane, and the xz-plane. Each plane is the "sheet 
of paper" that contains both axes the name mentions. For instance, the yz-plane contains both the y 
and z axes and is perpendicular to the x axis. 
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k = 



Again, both forms (component and standard) are equivalent. 


n 

2 = li + 2j + 3k 

,3 j 

Magnitude: Magnitude in three dimensions is the same as in two dimensions, with the addition of 
a ’z’ term in the radicand. 


u 


U 2 X + Uy + U 


2 

z 


52.3 Three dimensions 

The polar coordinate system is extended into three dimensions with two different coordinate sys- 
tems, the cylindrical and spherical coordinate systems, both of which include two-dimensional or 
planar polar coordinates as a subset. In essence, the cylindrical coordinate system extends polar 
coordinates by adding an additional distance coordinate, while the spherical system instead adds an 
additional angular coordinate. 
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52.3.1 Cylindrical coordinates 



The cylindrical coordinate system is a coordinate system that essentially extends the two- 
dimensional polar coordinate system by adding a third coordinate measuring the height of a point 
above the plane, similar to the way in which the Cartesian coordinate system is extended into three 
dimensions. The third coordinate is usually denoted h, making the three cylindrical coordinates (r, 
, h). 

The three cylindrical coordinates can be converted to Cartesian coordinates by 

x=r cos 6 
y = r sind 
z= h. 
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52.3.2 Spherical coordinates 



Figure 73: A point plotted using spherical coordinates 


Polar coordinates can also be extended into three dimensions using the coordinates (, , ), where 
is the distance from the origin, is the angle from the z-axis (called the colatitude or zenith and 
measured from 0 to 180°) and is the angle from the x-axis (as in the polar coordinates). This 
coordinate system, called the spherical coordinate system, is similar to the latitude and longitude 
system used for Earth, with the origin in the centre of Earth, the latitude being the complement of , 
determined by = 90° , and the longitude l being measured by 1= 1 80°. 

The three spherical coordinates are converted to Cartesian coordinates by 

x = p sirup cos0 
y = p sin0 sin0 
z = p cos <p. 


r = J x 2 + y 2 + z 2 , 


y 

6 =arctan-, 
x 


<p = arccos-, 
r 
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52.3.3 Cross Product 

The cross product of two vectors is a DETERMINANT 1 : 

i j k 

uxv= Ux Uy U z 
Vx Vy V z 

and is also a PSEUDOVECTOR 2 . 

The cross product of two vectors is orthogonal to both vectors. The magnitude of the cross product 
is the product of the magnitude of the vectors and the sin of the angle between them. 

|uxv| = |u||v| sind 

This magnitude is the area of the parallelogram defined by the two vectors. 

The cross product is linear and anticommutative. For any numbers a and b, 

u x (flv + bw) = fluxv+buxw uxv=-vxu 
If both vectors point in the same direction, their cross product is zero. 

52.3.4 Triple Products 

If we have three vectors we can combine them in two ways, a triple scalar product, 

u • (v x w) 

and a triple vector product 

u x (v x w) 

The triple scalar product is a determinant 

Ux Uy U z 
U • (v X w) = V x Vy v z 
W x Wy W z 

1 http ://en. wikipedia.org/wiki/determ inant 

2 http ://en. wikipedia.org/wiki/pseudovector 
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If the three vectors are listed clockwise, looking from the origin, the sign of this product is positive. 
If they are listed anticlockwise the sign is negative. 

The order of the cross and dot products doesn’t matter. 


u • (v x w) = (u x v) • w 

Either way, the absolute value of this product is the volume of the parallelepiped defined by the 
three vectors, u, v, and w 

The triple vector product can be simplified 


u x (v x w) = (u • w)v - (u • v)w 

This form is easier to do calculations with. 

The triple vector product is not associative. 


u x (v x w) ^ (u x v) x w 

There are special cases where the two sides are equal, but in general the brackets matter. They must 
not be omitted. 


52.3.5 Three-Dimensional Lines and Planes 

We will use r to denote the position of a point. 

The multiples of a vector, a all lie on a line through the origin. Adding a constant vector b will shift 
the line, but leave it straight, so the equation of a line is, 

r = as + b 

This is a parametric equation. The position is specified in terms of the parameter 5. 

Any lineal - combination of two vectors, a and b lies on a single plane through the origin, provided 
the two vectors are not colinear. We can shift this plane by a constant vector again and write 


r = as + bf + c 

If we choose a and b to be orthonormal vectors in the plane (i.e. unit vectors at right angles) then 
5 and t are Cartesian coordinates for points in the plane. 

These parametric equations can be extended to higher dimensions. 

Instead of giving parametric equations for the line and plane, we could use constraints. E.g., for any 
point in the xy plane z -0 
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For a plane through the origin, the single vector normal to the plane, n, is at right angle with every 
vector in the plane, by definition, so 


r -n = 0 

is a plane through the origin, normal to n. 

For planes not through the origin we get 

(r - a) • n = 0 r • n = a 

A line lies on the intersection of two planes, so it must obey the constraint for both planes, i.e. 

rn = n rm = b 

These constraint equations con also be extended to higher dimensions. 


52.4 Vector- Valued Functions 


Vector- Valued Functions are functions that instead of giving a resultant scalar value, give a resultant 
vector value. These aid in the create of direction and vector fields, and arc therefore used in physics 
to aid with visualizations of electric, magnetic, and many other fields. They arc of the following 
form: 


F(t) = 


r ai(t)' 

a 2 (t) 

a 3 (t) 


Va n (t); 


52.4.1 Introduction 


52.4.2 Limits, Derivatives, and Integrals 


Put simply, the limit of a vector-valued function is the limit of its parts. 
Proof: 


Suppose lim f ^ c F(f) = L = 


ar 

a 2 

a 3 


V a n/ 

Therefore for any e > 0 there is a cj) > 0 such that 


313 



Vectors 


0< \ t-c\ <(p => |F(f)-L| <e 

But by the triangle inequality |«i| < |F| < |«i| + |a 2 | + |a 3 | + ... + \a n \ |ai(f) - a\\ < |F(f) -L| 
So 


0< \ t-c\ <(p => \a\{t)-ai\<e 


Therefore 1 i m / _ r . a, (?) - a, A similar argument can be used through parts a_n(t) 


Now let lim r ^ f F(t) = L = 


a 3 


again, and that for any e>0 there is a corresponding (p>0 such 


O<lt-cl<0 implies 


Van ; 


I a„{t)-a n \ < | 
Then 


0 < | f — c| < 0 ==> |F(f)-L|<| + ... + ^=e 
therefore ! : 



fa, 'l 


Tim f _, c ai (t )' 1 


a 2 


lim f ^ c a 2 (t) 

lim f ^ c F(t) = L = 

a 3 

= 

lim f ^ c a 3 (t) 




Jim^ c a„(t) ; 


From this we can then create an accurate definition of a derivative of a vector-valued function: 


F'(f) = lim^o 


F(t+/;)-F(f) 

h 


r ai (t ) 1 
a 2 (t) 
a 3 (t) 


V a n (t)/ 


r ai (t + h )' 1 


r ai(t)' 

a 2 (t + h) 


a 2 (t) 

a 3 (t + h) 

- 

a 3 (t) 

,a n (t + h)j 


,a n (t) ; 


- lim - 

h — *0 h 
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BS 


ai(t+h)-ai(t) \ 


lim _ a 2 (t+h)-a2(t) 

lim/j 


0 h 

a 3 (t+h)-a 2 (t) 




The final step was accomplished by taking what we just did with limits. 

By the Fundamental Theorem of Calculus integrals can be applied to the vector’s components. 

In other words: the limit of a vector function is the limit of its parts, the derivative of a vector 
function is the derivative of its parts, and the integration of a vector function is the integration of it 
parts. 


52.4.3 Velocity, Acceleration, Curvature, and a brief mention of the Binormal 


Assume we have a vector- valued function which starts at the origin and as its independent variables 
changes the points that the vectors point at trace a path. 

We will call this vector r(f), which is commonly known as the position vector. 

If r then represents a position and t represents time, then in model with Physics we know the 
following: 

r(f+ h) -r(f) is displacement. r'(f) = v(f) where v(f) is the velocity vector. |v(f)| is the speed. 
r"(f) = v' ( t) = a(f) where a( f) is the acceleration vector. 


The only other vector that comes in use at times is known as the curvature vector. 

The vector T(f) used to find it is known as the unit tangent vector, which is defined as j^jj or 
shorthand v. 

T r ( 

The vector normal N to this then is |^j- 
We can verify this by taking the dot product 


T-N = 0 

Also note that |v(t)| = ^ 
and 


ds 


V 

dr 

dt 

_ dr 

~ Id “ 

ds 

~ ds 

dt 


T'(f) _ 

dT 


dt 

- tLL 

|v(f)| 

ds 

dt 

ds 


and 

N = 

Then we can actually verify: 

E<T-T> = A<1> 

d T fr . fr d T /-> 

ds ' 1 + 1 ' ds - u 
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2 * T 'f =° 

T 'f = 0 

T-N = 0 

Therefore N is perpendicular to T 

dr 

What this gives rise to is the Unit Normal Vector -ff- of which the top-most vector is the Normal 

Irfjl 

vector, but the bottom half (l^l) -1 is known as the curvature. Since the Normal vector points 
toward the inside of a curve, the sharper a turn, the Normal vector has a large magnitude, therefore 
the curvature has a small value, and is used as an index in civil engineering to reflect the sharpness 
of a curve (clover-leaf highways, for instance). 

The only other thing not mentioned is the Binormal that occurs in 3-d curves T x N = B, which is 
useful in creating planes parallel to the curve. 
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53 Lines and Planes in Space 


53.1 Introduction 

For many practical applications, for example for describing forces in physics and mechanics, you 
have to work with the mathematical descriptions of lines and planes in 3-dimensional space. 


53.2 Parametric Equations 

53.2.1 Line in Space 

A line in space is defined by two points in space, which I will call Pi and P 2 . Let Xi be the vector 
from the origin to Pi, and x 2 the vector from the origin to P 2 . Given these two points, every other 
point P on the line can be reached by 


x = Xi + Aa 


where a is the vector from Pi and P 2 : 


a = x 2 -Xi 



Figure 74: Line in 3D Space. 
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53.2.2 Plane in Space 

The same idea can be used to describe a plane in 3-dimensional space, which is uniquely defined by 
three points (which do not lie on a line) in space ( P\ , P-zP?,). Let x/ be the vectors from the origin 
to Pj. Then 


x = xi + Aa + pb 


with: 


a = X2 - Xi and b = X3 - xi 


Note that the starting point does not have to be xi, but can be any point in the plane. Similarly, 
the only requirement on the vectors a and b is that they have to be two non-collinear vectors in our 
plane. 


53.3 Vector Equation (of a Plane in Space, or of a Line in a Plane) 

An alternative representation of a Plane in Space is obtained by observing that a plane is defined by 
a point Pi in that plane and a direction perpendicular to the plane, which we denote with the vector 
n. As above, let xi describe the vector from the origin to Pi , and x the vector from the origin to 
another point P in the plane. Since any vector that lies in the plane is perpendicular to n, the vector 
equation of the plane is given by 


n- (x-xi) = 0 


In 2 dimensions, the same equation uniquely describes a Line. 


53.4 Scalar Equation (of a Plane in Space, or of a Line in a Plane) 

If we express n and x through their components 



f a ] 


( X 

n = 

b 

, andx = 

y 


, C j 


< Z ; 


writing out the scalar product for n- (x-xi) = 0 provides us with the scalar equation for a plane in 
space: 
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ax + by + cz = d 


where d = n • xj . 

In 2d space, the equivalent steps lead to the scalar equation for a line in a plane: 


ax+by = c 


l 


1 http://en.wikibooks.org/wiki/ Category%3ACalculus%2 0%2 8book%2 9 
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54 Multivariable & Differential Calculus 


In your previous study of calculus, we have looked at functions and their behavior. Most of these 
functions we have examined have been all in the form 

fix) : R — R. 

and only occasional examination of functions of two variables. However, the study of functions of 
several variables is quite rich in itself, and has applications in several fields. 

We write functions of vectors - many variables - as follows: 

/: R m - R n 

and fix) for the function that maps a vector in R m to a vector in R n . 

Before we can do calculus in R n , we must f ami liarize ourselves with the structure of R n . We need 
to know which properties of R can be extended to R n 


54.1 Topology in R n 


We arc already familial - with the nature of the regular real number line, which is the set R, and the 
two-dimensional plane, R 2 . This examination of topology in R n attempts to look at a generalization 
of the nature of //-dimensional spaces - R, or R 23 , or R n . 


54. 1 . 1 Lengths and distances 

If we have a vector in R 2 , we can calculate its length using the Pythagorean theorem. For instance, 
the length of the vector (2, 3) is 


= V\3 

We can generalize this to R n . We define a vector’s length, written Ixl, as the square root of the sum 
of the squares of each of its components. That is, if we have a vector x=(xi,...,Xn), 


|x| = \J x\ + x\ H + X^ 

Now that we have established some concept of length, we can establish the distance between two 
vectors. We define this distance to be the length of the two vectors’ difference. We write this 
distance d(x, y ), and it is 
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d(x,y) = |x-y| = \JyJxi ~ yd 2 

This distance function is sometimes referred to as a metric. Other metrics arise in different circum- 
stances. The metric we have just defined is known as the Euclidean metric. 


54.1.2 Open and closed balls 

In R, we have the concept of an interval, in that we choose a certain number of other points about 
some central point. For example, the interval [-1, 1] is centered about the point 0, and includes 
points to the left and right of zero. 

In R 2 and up, the idea is a little more difficult to carry on. For R 2 , we need to consider points to the 
left, right, above, and below a certain point. This may be fine, but for R 3 we need to include points 
in more directions. 

We generalize the idea of the interval by considering all the points that arc a given, fixed distance 
from a certain point - now we know how to calculate distances in R n , we can make our generalization 
as follows, by introducing the concept of an open ball and a closed ball respectively, which arc 
analogous to the open and closed interval respectively. 

an open ball 


B (a, r) 


is a set in the form { x &isin; R^ldfx, a) < r} 
a closed ball 


B{ a, r) 


is a set in the form { x &isin; R"ld(x, a) < r} 

In R, we have seen that the open ball is simply an open interval centered about the point x=a. In R 2 
this is a circle with no boundary, and in R 3 it is a sphere with no outer surface. ( What would the 
closed ball be?) 

if every point in it is within a finite distance of the origin, i.e there exists some />() such that x is in 
S implies Ixkr. 


54.2 Curves and parameterizations 

If we have a function/: R -*■ R n , we say that/s image (the set {//) I r&isin; R} - or some subset 
of R) is a curve in R" and /is its parametrization. 
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Parameterizations arc not necessarily unique - for example, /(f) = (cos f, sin f) such that f &isin; [0, 
2 n) is one parametrization of the unit circle, and g(f) = (cos at, sin at) such that t &isin; [0, 2 n/a) is 
a whole family of parameterizations of that circle. 


54.2. 1 Collision and intersection points 

Say we have two different curves. It may be important to consider 

• points the two curves share - where they intersect 

• intersections which occur for the same value oft - where they collide. 


Intersection points 

Firstly, we have two parameterizations /(f) and g(f), and we want to find out when they intersect, 
this means that we want to know when the function values of each parametrization are the same. 
This means that we need to solve 

fit) = g(s) 

because we’re seeking the function values independent of the times they intersect. 

For example, if we have /(f) = (f, 3f) and g(f) = (f, f 2 ), and we want to find intersection points: 

fit) = g(s) 

(f, 3f) = (s, s 2 ) 
t = s and 3 f = s 2 

with solutions (f, s) = (0, 0) and (3, 3) 

So, the two curves intersect at the points (0, 0) and (3, 3). 


Collision points 

However, if we want to know when the points "collide", with fit) and g(f), we need to know when 
both the function values and the times are the same, so we need to solve instead 

fit) = git) 

For example, using the same functions as before, fit) = (f, 3f) and g(f) = (f, t 2 ), and we want to find 
collision points: 

fit) = git) 

(f, 3f) = (f, t 2 ) 
t= t and 3 f = f 

which gives solutions f = 0, 3 So the collision points are (0, 0) and (3, 3). 

We may want to do this to actually model physical problems, such as in ballistics. 
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54.2.2 Continuity and differentiability 

If we have a parametrization/: R — R'\ which is built up out of component functions in the form 
ft) = (f(t),...,f n ( t)), f is continuous if and only if each component function is also. 

In this case the derivative of ft) is 

a, = (/] &prime;(f),...,/„&prime;(Q). This is actually a specific consequence of a more 
general fact we will see later. 

54.2.3 Tangent vectors 

Recall in single-variable calculus that on a curve, at a certain point, we can draw a line that is tangent 
to that curve at exactly at that point. This line is called a tangent. In the several variable case, we 
can do something si mi lar. 

We can expect the tangent vector to depend on /&prime;(f) and we know that a line is its own 
tangent, so looking at a parametrised line will show us precisely how to define the tangent vector 
for a curve. 

An arbitrary line is f( f)=a/+b. with :f( f)=Oj t+b,. so 
/j&primc;( f)=Oj and 

/&primc;( f)=a, which is the direction of the line, its tangent vector. 

Similarly, for any curve, the tangent vector is /&primc;( t). 
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Angle between curves 



Figure 75 


We can then formulate the concept of the angle between two curves by considering the angle be- 
tween the two tangent vectors. If two curves, parametrized by f\ and fi intersect at some point, 
which means that 

f(s)=f 2 (t)=c, 

the angle between these two curves at c is the angle between the tangent vectors / &prime;(s) and 
/ 2 &pmuc;(f) is given by 


arccos 


ir i U)||f' 2 (r)| 
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Tangent lines 

With the concept of the tangent vector as being analogous to being the gradient of the line in the 
one variable case, we can form the idea of the tangent line. Recall that we need a point on the line 
and its direction. 

If we want to form the tangent line to a point on the curve, say p, we have the direction of the line 
/&primc;( p), so we can form the tangent line 

x(t)=p+tj&pnme;(p) 


54.2.4 Different parameterizations 

One such parametrization of a curve is not necessarily unique. Curves can have several different 
parametrizations. For example, we already saw that the unit circle can be parametrized by g(t) = 
(cos at, sin at) such that r&isin; [0, 2n/a). 

Generally, if /is one parametrization of a curve, and gis another, with 
/Zb) = gOo) 

there is a function u(t) such that u( to)=so, and g(u(t)) -f(t) near to- 

This means, in a sense, the function u(t) "speeds up" the curve, but keeps the curve’s shape. 

54.2.5 Surfaces 

A surface in space can be described by the image of a function / : R 2 — R n . /is said to be the 
parametrization of that surface. 

For example, consider the function 

/a, p) = a(2,l,3)+jS(- 1,2,0) 

This describes an infinite plane in R 3 . If we restrict a and /3 to some domain, we get a parallelogram- 
shaped surface in R 3 . 

Surfaces can also be described explicitly, as the graph of a function z - fix, y) which has a standard 
parametrization as fix,y)=(x, y, fix,y)), or implictly, in the form fix, y, z)=c. 


Level sets 

The concept of the level set (or contour) is an important one. If you have a function fix, y, z), a level 
set in R 3 is a set of the form { ( x,y,z)\fi x,y,z)=c) . Each of these level sets is a surface. 

Level sets can be similarly defined in any R n 

Level sets in two dimensions may be familial - from maps, or weather charts. Each line represents 
a level set. Lor example, on a map, each contour represents all the points where the height is the 
same. On a weather chart, the contours represent all the points where the air pressure is the same. 
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54.3 Limits and continuity 

Before we can look at derivatives of multivariate functions, we need to look at how limits work with 
functions of several variables first, just like in the single variable case. 

If we have a function /: R” 7 — R", we say that fix) approaches b (in R") as x approaches a (in R” 7 ) 
if, for all positive e. there is a corresponding positive number 8, \fix)-b\ < e whenever lx- < 2 1 < 8, with 
xfi a. 

This means that by making the difference between x and a smaller, we can make the difference 
between fix) and b as small as we want. 

If the above is true, we say 

• fix) has limit bat a 

• lim x _ a f(x) = b 

• fix) approaches b as x approaches a 

• fix) — ■ b as x — a 

These four statements are all equivalent. 


54.3.1 Rules 

Since this is an almost identical formulation of limits in the single variable case, many of the limit 
rules in the one variable case arc the same as in the multivariate case. 

For / and g, mapping R” 7 to R 77 , and h(x) a scalar function mapping R 777 to R, with 

• fix) — b as x — a’ 

• g(x) — c as x — a’ 

• h(x) — H as x — a 

then: 

• lim x _ a (f+g) =b + c 

• lim x _ a (/if) = Hb 

and consequently 

• lim x _ a (f-g) = b-c 

• lim x _a(f x g) = bxc 

when H^O 

• lim x ^ a (£) = jj 

54.3.2 Continuity 

Again, we can use a similar definition to the one variable case to formulate a definition of continuity 
for multiple variables. 

If/: R m - R", f is continuous at a point a in R' 77 if fid) is defined and 
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limf(x) = f(a) 

x^a 

Just as for functions of one dimension, if fig arc both continuous at p, f+g, Xf (for a scalar A), fig, 
and /&timcs;garc continuous also. If cp : R w — ► R is continus at p, <pf jhp are too if <p is never zero. 

From these facts we also have that if A is some matrix which is n&times;m in size, with x in R"\ 
a function fix)=A x is continuous in that the function can be expanded in the form x\a\+...+x m a m , 
which can be easily verified from the points above. 

If f : R " 1 — R" which is in the form fix) = ( f\ (x),...,f n (x) is continuous if and only if each of its 
component functions arc a polynomial or rational function, whenever they arc defined. 

Finally, if /is continuous at p, gis continuous at fip), gffix)) is continuous at p. 

54.3.3 Special note about limits 

It is important to note that we can approach a point in more than one direction, and thus, the 
direction that we approach that point counts in our evaluation of the limit. It may be the case that a 
limit may exist moving in one direction, but not in another. 


54.4 Differentiable functions 


We will start from the one-variable definition of the derivative at a point /;. namely 


lim 

x^p 


fM- f [p) 
x- p 


= fip) 


Let’s change above to equivalent form of 


Um f[x)-f(p)-f{p)(x-p) = Q 
X^p x-p 

which achieved after pulling f’(p) inside and putting it over a common denominator. 

We can’t divide by vectors, so this definition can’t be immediately extended to the multiple variable 
case. Nonetheless, we don’t have to: the thing we took interest in was the quotient of two small 
distances (magnitudes), not their other properties (like sign). It’s worth noting that ’other' property 
of vector neglected is its direction. Now we can divide by the absolute value of a vector, so lets 
rewrite this definition in terms of absolute values 


,. \f(x)-f(p)-f{p)(x-p)\ 

lim i j = 0 

X^P \ X ~P\ 

Another form of formula above is obtained by letting h = x- p we have x = p + h and if x — p, the 
h - x- p — 0, so 
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\f(p+h)~ f{p)~ f'{p)h\ 

lim m 

o \h\ 


= 0 


where h can be thought of as a 'small change’. 

So, how can we use this for the several-variable case? 

If we switch all the variables over to vectors and replace the constant (which performs a linear map 
in one dimension) with a matrix (which denotes also a linear map), we have 

|f(x) - f (p) - A(x - p) | 

lim = 0 

x *p |x — p| 


or 


,. |f(p + h)-f(p) -Ah| 

lim = 0 

h^O |h| 

If this limit exists for some /: R"' — » R", and there is a linear map A : R m — ► R” (denoted by matrix 
A which is m&times;n), we refer to this map as being the derivative and we write it as D p f. 

A point on terminology - in referring to the action of taking the derivative (giving the linear map A), 
we write D p f but in referring to the matrix A itself, it is known as the Jacobian matrix and is also 
written J p f. More on the Jacobian later. 


54.4.1 Properties 

There are a number of important properties of this formulation of the derivative. 


Affine approximations 

If /is differentiable at p for x close to p, \fix)-(fi p)+A(x-p))\ is small compared to lx- pi, which means 
that fix) is approximately equal to J[p)+A(x-p). 

We call an expression of the form g(x)+c affine, when g(x) is linear and cis a constant. J[p)+A(x-p) 
is an affine approximation to fix). 


Jacobian matrix and partial derivatives 

The Jacobian matrix of a function is in the form 


Mil 


dfi_ 

dxj 


p 
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for a/: R m — R' ! , J p fis an&t imespn ma trix. 

The consequence of this is that if /is differentiable at p, all the partial derivatives of /exist at p. 

However, it is possible that all the partial derivatives of a function exist at some point yet that 
function is not differentiable there, so it’s very important not to mix derivative (linear map) with the 
Jacobian (matrix) especially when cited situation arose. 


Continuity and differentiability 

Furthermore, if all the partial derivatives exist, and arc continuous in some neighbourhood of a 
point p, then / is differentiable at p. This has the consequence that for a function / which has 
its component functions built from continuous functions (such as rational functions, differentiable 
functions or otherwise), /is differentiable everywhere /is defined. 

We use the terminology continuously differentiable for a function differentiable at p which has all 
its partial derivatives existing and are continuous in some neighbourhood at p. 


54.4.2 Rules of taking Jacobians 

If/: R m -*■ R'\ and h(x) : R m — R arc differentiable at ’p’: 

* /p(f+g) = /pf+/ P g 

* / p (/if) = /i/pf+f(p)/ p /t 

* /p(f'g) =g r /pf+f r /pg 

Important: make sure the order is right - matrix multiplication is not commutative! 


Chain rule 

The chain rule for functions of several variables is as follows. For /: R” 1 — ► R" and g : R" — R p , 
and g o /differentiable at p, then the Jacobian is given by 

(%»g)(/pf) 

Again, we have matrix multiplication, so one must preserve this exact order. Compositions in one 
order may be defined, but not necessarily in the other way. 


54.4.3 Alternate notations 


For simplicity, we will often use various standard abbreviations, so we can write most of the formu- 
lae on one line. This can make it easier to see the important details. 

We can abbreviate partial differentials with a subscript, e.g., 


d X h{x, y) 


dh 

dx 


d x d y h = d y d x h 
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When we are using a subscript this way we will generally use the Heaviside D rather than &part;, 

dh 

D x h{x,y ) = — D x D y h = D y D x h 

Mostly, to make the formulae even more compact, we will put the subscript on the function itself. 

D x h = h x h X y = liyx 

If we are using subscripts to label the axes, xi, X2 &hellip;, then, rather than having two layers of 
subscripts, we will use the number as the subscript. 


hi = Dih = dih = d Xl h= - — 

ox 1 

We can also use subscripts for the components of a vector function, u =(u x , u y , u v ) or 
u =(ui, u^&hellip ; u„) 

If we are using subscripts for both the components of a vector and for partial derivatives we will 
separate them with a comma. 


_ du x 

u x,y ~ % 

dy 

The most widely used notation is h x . Both hi and &part;ih are also quite widely used whenever the 
axes are numbered. The notation &part; x h is used least frequently. 

We will use whichever notation best suits the equation we are working with. 


54.4.4 Directional derivatives 

Normally, a partial derivative of a function with respect to one of its variables, say, Xj, takes the 
derivative of that "slice" of that function parallel to the x/th axis. 

More precisely, we can think of cutting a function flx\ ,...,x„) in space along the x/th axis, with 
keeping everything but the xj variable constant. 

From the definition, we have the partial derivative at a point p of the function along this slice as 

df , ftp + fed- f(p) 

= lim 

OXj f— 0 t 


provided this limit exists. 

Instead of the basis vector, which corresponds to taking the derivative along that axis, we can pick 
a vector in any direction (which we usually take as being a unit vector), and we take the directional 
derivative of a function as 
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3f f(p + td) - f(p) 

— = lim 

3d t—o t 

where d is the direction vector. 

If we want to calculate directional derivatives, calculating them from the limit definition is rather 
painful, but, we have the following: if/: R<Sup>n</sup> — • R is differentiable at a point p, lpl=l, 

3f 

aa = Dpf(d) 

There is a closely related formulation which we’ll look at in the next section. 


54.4.5 Gradient vectors 

The partial derivatives of a scalar tell us how much it changes if we move along one of the axes. 
What if we move in a different direction? 

We’ll call the scalar / and consider what happens if we move an infintesimal direction 
dr=(dx,dy,dz), using the chain rule. 


df df df 
df = dx— + dy— + dz— 
dx ' dy dz 


This is the dot product of dr with a vector whose components are the partial derivatives of / called 
the gradient of / 

gradf=Vf=(fS, ... ,S») 

We can form directional derivatives at a point p, in the direction d then by taking the dot product of 
the gradient with d 


3f(p) 

3d 


d Vf(p) 


Notice that grad /looks like a vector multiplied by a scalar. This particular combination of partial 
derivatives is commonplace, so we abbreviate it to 

_ 3 3 3 1 

,3x’ dy’ dz) 

We can write the action of taking the gradient vector by writing this as an operator. Recall that in 
the one-variable case we can write dJdx for the action of taking the derivative with respect to x. This 
case is similar, but &nabla; acts like a vector. 

We can also write the action of taking the gradient vector as: 

( 3 3 3 1 

I ,3xi’3x2 I dx n ) 
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Properties of the gradient vector 
Geometry 


• Grad j{p) is a vector pointing in the direction of steepest slope of f. Igrad//;)l is the rate of change 
of that slope at that point. 

For example, if we consider h(x, j/)=jr+jr. The level sets of h are concentric circles, centred on the 
origin, and 


Vh = {h x> hy) = 2(x, y ) = 2r 

grad h points directly away from the origin, at right angles to the contours. 

• Along a level set, (&nabla \j){p) is perpendicular to the level set { jd J{x)=J[p) at x=p}. 

If dr points along the contours of / where the function is constant, then df will be zero. Since df is a 
dot product, that means that the two vectors, df and grad f must be at right angles, i.e. the gradient 
is at right angles to the contours. 


Algebraic properties 

Like d/dx, &nabla; is linear. For any pair of constants, a and b, and any pair of scalar functions, / 
and g 


d 

dx 


(af+bg) ‘ a ic f+b ic g 


V{af+bg ) = aVf+bVg 


Since it’s a vector, we can tty taking its dot and cross product with other vectors, and with itself. 


54.4.6 Divergence 


If the vector function u maps R n to itself, then we can take the dot product of u and &nabla;. This 
dot product is called the divergence. 


div u = V • u = 


dui du,2 
dxi dx2 


du n 

dx„ 


If we look at a vector function like v=(l+x 2 ,xy) we can see that to the left of the origin all the v 
vectors are converging towards the origin, but on the right they arc diverging away from it. 

Div u tells us how much u is converging or diverging. It is positive when the vector is diverging 
from some point, and negative when the vector is converging on that point. 

Example: 

For u=(l+x 2 , xy), div v=3x, which is positive to the right of the origin, where v is 
diverging, and negative to the left of the origin, where v is converging. 

Like grad, div is linear. 
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V • (au + bv ) = a V • u + bS7 ■ v 

Later in this chapter we will see how the divergence of a vector function can be integrated to tell us 
more about the behaviour of that function. 

To find the divergence we took the dot product of &nabla; and a vector with &nabla; on the left. If 
we reverse the order we get 


U • V = U X D X + UyDy + u z D z 

To see what this means consider i&nabla; This is D x , the partial differential in the i direction. 
Similarly, u&nabla; is the partial differential in the u direction, multiplied by lul 


54.4.7 Curl 

If u is a three-dimensional vector function on R 3 then we can take its cross product with &nabla;. 
This cross product is called the curl. 


curl u = V x u = 


i j k 

D x Dy D z 

U X Uy U z 


Curl u tells us if the vector u is rotating round a point. The direction of curl u is the axis of rotation. 

We can treat vectors in two dimensions as a special case of three dimensions, with u z = 0 and D z u=0. 
We can then extend the definition of curl u to two-dimensional vectors 


curl u = D y u x - D x u y 

This two dimensional curl is a scalar. In four, or more, dimensions there is no vector equivalent to 
the curl. 

Example: 


Consider u=( -y, x). These vectors arc tangent to circles centred on the origin, so appeal - to be rotating 
around it anticlockwise. 


curl u = D y {-y) - D x x = -2 


Example 

Consider u=( -y, x-z, y), which is si mi lar to the previous example. 
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i j k 

curl u = D x D y D z = 2i + 2k 

-y x-z y 

This u is rotating round the axis i+k 

Later in this chapter we will see how the curl of a vector function can be integrated to tell us more 
about the behaviour of that function. 

54.4.8 Product and chain rules 

Just as with ordinary differentiation, there are product rules for grad, div and curl. 

• If g is a scalar and visa vector, then 

the divergence of gv is 

V-(gv) = gV-v+ (v- V)g 

the curl of gv is 

V x (gv) = g(V x v) + (Vg) x v 

• If u and v are both vectors then 

the gradient of their dot product is 

V(u-v) = u x (V x v) +vx (V x u) + (u- V)v+ (v- V)u 
the divergence of their cross product is 

V • (u x v) = v- (V x u) - u • (V x v) 
the curl of their cross product is 

V x (ux v) = (v- V)u- (u- V)v + u(V-v) -v(V-u) 

We can also write chain rules. In the general case, when both functions are vectors and the compo- 
sition is defined, we can use the Jacobian defined earlier. 

Vu(v)| r = J v Vv| r 
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where J u is the Jacobian of u at the point v. 

Normally J is a matrix but if either the range or the domain of u is R 1 then it becomes a vector. In 
these special cases we can compactly write the chain rule using only vector notation. 

• If g is a scalar function of a vector and h is a scalar function of g then 

, dh 
Vft(g) - -r-Vg 
dg 

• If g is a scalar function of a vector then 


d 

V = (Vg) — 
dg 

This substitution can be made in any of the equations containing &nabla; 


54.4.9 Second order differentials 

We can also consider dot and cross products of &nabla; with itself, whenever they can be defined. 
Once we know how to simplify products of two &nabla;’s we’ll know out to simplify products with 
three or more. 

The divergence of the gradient of a scalar /is 


d 2 f , d 2 f 


d 2 f 


V f(.X\ , X 2> . . . X n ) — — y y + — + — 2 
0X7 OX f 0X7, 


This combination of derivatives is the Laplacian of / It is commmonplace in physics and multidi- 
mensional calculus because of its simplicity and symmetry. 

We can also take the Laplacian of a vector, 


2 d 2 u d 2 u 0" u 

V u(Xi,X 2 ,...X„) = ■— -J + -y +... + ■— y 
oxf ox f oxf, 


d 2 - 


The Laplacian of a vector is not the same as the divergence of its gradient 


V(V-u) — V 2 u = V x(Vxu) 

Both the curl of the gradient and the divergence of the curl arc always zero. 


V x V/ = 0 V • (V x u) = 0 


This pair of rules will prove useful. 
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54.5 Integration 

We have already considered differentiation of functions of more than one variable, which leads us 
to consider how we can meaningfully look at integration. 

In the single variable case, we interpret the definite integral of a function to mean the area under the 
function. There is a similar interpretation in the multiple variable case: for example, if we have a 
paraboloid in R 3 , we may want to look at the integral of that paraboloid over some region of the xy 
plane, which will be the volume under that curve and inside that region. 


54.5.1 Riemannsums 

When looking at these forms of integrals, we look at the Riemann sum. Recall in the one-variable 
case we divide the interval we arc integrating over into rectangles and summing the areas of these 
rectangles as their widths get smaller and smaller. For the multiple -variable case, we need to do 
something similar, but the problem arises how to split up R 2 , or R 3 , for instance. 

To do this, we extend the concept of the interval, and consider what we call a n-interval. An 
//-interval is a set of points in some rectangular region with sides of some fixed width in each 
dimension, that is, a set in the form {x&isin;R”laj < Xj < b, with / = (),...,«}, and its arca/sizc/volumc 
(which we simply call its measure to avoid confusion) is the product of the lengths of all its sides. 

So, an 72-interval in R 2 could be some rectangular partition of the plane, such as ( ( x,y) I x &isin; 
[0,1] and y&isin; [0, 2]l } . Its measure is 2. 

If we arc to consider the Riemann sum now in terms of sub-n-intervals of a region £}, it is 


Y /(**) m{S i) 

i;S jc(l 

where m( S,~) is the measure of the division of £2 into k sub-n-intervals Si, and x* ,■ is a point in S,\ 
The index is important - we only perform the sum where S,- falls completely within £2 - any S\ that 
is not completely contained in £2 we ignore. 

As we take the limit as k goes to infinity, that is, we divide up £1 into finer and finer sub- //-intervals, 
and this sum is the same no matter how we divide up £2, we get the integral of / over £1 which we 
write 



For two dimensions, we may write 



and likewise for n dimensions. 
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54.5.2 Iterated integrals 

Thankfully, we need not always work with Riemann sums every time we want to calculate an 
integral in more than one variable. There arc some results that make life a bit easier for us. 

For R 2 , if we have some region bounded between two functions of the other variable (so two func- 
tions in the form flx) = y, or fly) = x), between a constant boundary (so, between x = a and x=b or 
y = a and y = b), we have 


a 


gix) 

f(x) 


h{x, y ) dydx 


An important theorem (called Fubini’s theorem ) assures us that this integral is the same as 



54.5.3 Order of integration 

In some cases the first integral of the entire iterated integral is difficult or impossible to solve, 
therefore, it can be to our advantage to change the order of integration. 

fa f fS h{x ’y ] dxd y 

fc fe{y) h ^y^ d y dx 

As of the writing of this, there is no set method to change an order of integration from dxdy to dydx 
or some other variable. Although, it is possible to change the order of integration in an x and y 
simple integration by simply switching the limits of integration around also, in non-simple x and y 
integrations the best method as of yet is to recreate the limits of the integration from the graph of 
the limits of integration. 

In higher order integration that can’t be graphed, the process can be very tedious. For example, 
dxdydz can be written into dzdydx, but first dxdydz must be switched to dydxdz and then to dydzdx 
and then to dzdydx (but since 3-dimensional cases can be graphed, doing this would be seemingly 
idiotic). 


54.5.4 Parametric integrals 

If we have a vector function, u, of a scalar parameter, s, we can integrate with respect to s simply 
by integrating each component of u separately. 


v(s) 


= J u {s) ds => Vj{s) = J u, (s)ds 


Similarly, if u is given a function of vector of parameters, s, lying in R”, integration with respect to 
the parameters reduces to a multiple integral of each component. 
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54.5.5 Line integrals 



In one dimension, saying we are integrating from a to b uniquely specifies the integral. 

In higher dimensions, saying we are integrating from a to b is not sufficient. In general, we must 
also specify the path taken between a and b. 

We can then write the integrand as a function of the arclength along the curve, and integrate by 
components. 

E.g., given a scalar function h( r) we write 
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f h(r)dr= f h(r)^-ds = f h(r(s))t(s) ds 
Jc Jc ds Jc 

where C is the curve being integrated along, and t is the unit vector tangent to the curve. 
There are some particularly natural ways to integrate a vector function, u, along a curve, 


I u ds I u dr I uxdr I u ncfs 

Jc Jc Jc Jc 


where the third possibility only applies in 3 dimensions. 

Again, these integrals can all be written as integrals with respect to the arclength, s. 


I u dr= I u-tds / uxdr= I uxtds 

Jc Jc Jc Jc 


If the curve is planar and u a vector being in the same plane, the second integral can be usefully 
rewritten. Say, 


u= w f t+ u n n + Ub b 

where t, n, and b arc the tangent, normal, and binormal vectors uniquely defined by the curve. 
Then 


u x t = -bu n + niib 

For the 2-d curves specified b is the constant unit vector normal to their plane, and Ub is always 
zero. 

Therefore, for such curves, 


I u x dr = I u- n ds 

Jc Jc 
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Green’s Theorem 



Figure 77 


Let C be a piecewise smooth, simple closed curve that bounds a region S on the Cartesian plane. If 
two function M(x,y) and N(x,y) are continuous and their partial derivatives are continuous, then 

ffstiE - "> dA = §c Mdx + Nd y = $c F ' dr 

In order for Green’s theorem to work there must be no singularities in the vector field within the 
boundaries of the curve. 

Green’s theorem works by summing the circulation in each infinitesimal segment of area enclosed 
within the curve. 


Inverting differentials 

We can use line integrals to calculate functions with specified divergence, gradient, or curl. 
• If grad V= u 
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u- dr + h{ p) 


where h is any function of zero gradient and curl u must be zero. 
• If div u = V 



V dr + w(p) 


where w is any function of zero divergence. 
• If curl u = v 



dr + w(p) 


where w is any function of zero curl. 
For example, if V=r then 


grad V = 2(x, y, z ) = 2r 


and 


fg 2u • d u 


fg2{udu+ vd v + wd w) 

[“ 2 ]o+Ho + Mo 

? 9 2 2 

x +y +z = r 


so this line integral of the gradient gives the original function. 
Similarly, if v=k then 


u(p) =/ k x dr 
Jpo 

Consider any curve from 0 to p=(x, y’, z), given by r=r( s) with r(0)=0 and r(S)=p for some S, and 
do the above integral along that curve. 


u(p) 


fo kx Ts ds 

j[rjc(s)]o -i[fy(5)]o 

Px j - Pyi = X) - yi 


and curl u is 
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1 

2 


i 

D x 

-y 


j 


D v 


k 

D z 

0 


= k = v 


as expected. 

We will soon see that these three integrals do not depend on the path, apart from a constant. 


54.5.6 Surface and Volume Integrals 

Just as with curves, it is possible to parameterise surfaces then integrate over those parameters 
without regal'd to geometry of the surface. 

That is, to integrate a scalar function V over a surface A parameterised by r and s we calculate 

/ V{x,y,z)dS = / / V{r,s)detJ drds 

Ja J Ja 

where J is the Jacobian of the transformation to the parameters. 

To integrate a vector this way, we integrate each component separately. 

However, in three dimensions, every surface has an associated normal vector n, which can be used 
in integration. We write dS=ndS. 

For a scalar function, V, and a vector function, v, this gives us the integrals 

f V dS f v • dS f v x dS 

Ja Ja Ja 

These integrals can be reduced to parametric integrals but, written this way, it is clear that they 
reflect more of the geometry of the surface. 

When working in three dimensions, dV is a scalar, so there is only one option for integrals over 
volumes. 
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54.5.7 Gauss’s divergence theorem 

zi 



Figure 78 

We know that, in one dimension. 


f Dfdx = f\ b a 

J a 

Integration is the inverse of differentiation, so integrating the differential of a function returns the 
original function. 

This can be extended to two or more dimensions in a natural way, drawing on the analogies between 
single variable and multivariable calculus. 

The analog of Dis &nabla;, so we should consider cases where the integrand is a divergence. 

Instead of integrating over a one-dimensional interval, we need to integrate over a u-dimensional 
volume. 

In one dimension, the integral depends on the values at the edges of the interval, so we expect the 
result to be connected with values on the boundary. 

This suggests a theorem of the form. 


/ V -udV = / n urfS 

JV JdV 
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This is indeed true, for vector fields in any number of dimensions. 
This is called Gauss’s theorem. 

There are two other, closely related, theorems for grad and curl: 

• f v S/udV = f gv undS, 

• f v V x udV = f dv n x udS, 

with the last theorem only being valid where curl is defined. 

54.5.8 Stokes’ curl theorem 


Figure 79 



WIKIVERSITY 


These theorems also hold in two dimensions, where they relate surface and line integrals. Gauss’s 
divergence theorem becomes 


| V udS = f n u ds 

JS JdS 

where s is arclength along the boundary curve and the vector n is the unit normal to the curve that 
lies in the surface S, i.e. in the tangent plane of the surface at its boundary, which is not necessarily 
the same as the unit normal associated with the boundary curve itself. 

Similarly, we get 


/ V x u dS = I n 

Js Jc 


x u ds 


( 1 ) 
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where C is the boundary of S 

In this case the integral does not depend on the surface S. 

To see this, suppose we have different surfaces, Si and S 2 , spanning the same curve C, then by 
switching the direction of the normal on one of the surfaces we can write 

/ Vxu dS= / VxurfS- / VxurfS (2) 

Js!+s 2 Js Js 

The left hand side is an integral over a closed surface bounding some volume V so we can use 
Gauss’s divergence theorem. 


/ Vxu dS= / V-Vxu dV 

JS1 + S2 Jv 

but we know this integrand is always zero so the right hand side of (2) must always be zero, i.e. the 
integral is independent of the surface. 

This means we can choose the surface so that the normal to the curve lying in the surface is the 
same as the curves intrinsic normal. 

Then, if u itself lies in the surface, we can write 


u = (u • n) n + (u • t) t 

just as we did for line integrals in the plane earlier, and substitute this into (1) to get 


I V x u dS = / u- dr 
Js Jc 


This is Stokes’ curl theorem 
1 


1 http://en.wikibooks.org/wiki /Cate gor y%3ACalculus%2 0%2 8book%2 9 
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Ordinary differential equations involve equations containing: 

• variables 

• functions 

• their derivatives 

and their solutions. 

In studying integration, you already have considered solutions to very simple differential equations. 
For example, when you look to solving 



dx = g(x) 


for g(x), you are really solving the differential equation 


g'(x) = /(x) 


55.1 Notations and terminology 


The notations we use for solving differential equations will be crucial in the ease of solubility for 
these equations. 

This document will be using three notations primarily: 

• f’ to denote the derivative of f 

• D /to denote the derivative of / 

• to denote the derivative of /(for separable equations). 


55.1.1 Terminology 

Consider the differential equation 


3/"(x) + 5x/(x) = ll 

Since the equation’s highest derivative is 2, we say that the differential equation is of order 2. 
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55.2 Some simple differential equations 

A key idea in solving differential equations will be that of INTEGRATION 1 . 

Let us consider the second order differential equation (remember that a function acts on a value). 


fix) = 2 


How would we go about solving this? It tells us that on differentiating twice, we obtain the constant 
2 so, if we integrate twice, we should obtain our result. 

Integrating once first of all: 



(x) dx = 


/ 


2 dx 


fix) =2x + C\ 

We have transformed the apparently difficult second order differential equation into a rather simpler 
one, viz. 


fix) = 2x + Ci 

This equation tells us that if we differentiate a function once, we get 2x + Ci. If we integrate once 
more, we should find the solution. 



(x) dx = 


/ 


2x + Ci dx 


f (x) — f + Ci x + C 2 

This is the solution to the differential equation. We will get f" = 2 for all values of Ci and C 2 . 

The values Ci and C 2 arc related to quantities known as initial conditions. 

Why arc initial conditions useful? ODEs (ordinary differential equations) arc useful in modeling 
physical conditions. We may wish to model a certain physical system which is initially at rest 
(so one initial condition may be zero), or wound up to some point (so an initial condition may be 
nonzero, say 5 for instance) and we may wish to see how the system reacts under such an initial 
condition. 

When we solve a system with given initial conditions, we substitute them after our process of 
integration. 


1 http://en.wikibooks.org/wiki/Calculus%2FDefinite%20integral 


348 



Basic first order DEs 


55.2.1 Example 

When we solved fix) = 2 say we had the initial conditions /'( 0) = 3 and /( 0) = 2. (Note, initial 
conditions need not occur at f(0)). 

After we integrate we make substitutions: 

/'(()) = 2(0) + Ci 


3 = Ci 



ix)dx = 


/ 


2x + 3 dx 


f{x) - x 2 + 3x + C 2 


/(0) = 0 2 + 3(0) + C 2 


2 = C 2 


fix) = x 2 + 3x + 2 

Without initial conditions, the answer we obtain is known as the general solution or the solution to 
the family of equations. With them, our solution is known as a specific solution. 


55.3 Basic first order DEs 

In this section we will consider four main types of differential equations: 

• separable 

• homogeneous 

• linear 

• exact 

There are many other forms of differential equation, however, and these will be dealt with in the 
next section 
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55.3.1 Separable equations 

A separable equation is in the form (using dy/dx notation which will serve us greatly here) 

Previously we have only dealt with simple differential equations with g(y)=l. How do we solve 
such a separable equation as above? 

We group x and dx terms together, and y and dy terms together as well. 


g(y) dy = fix) dx 

Integrating both sides with respect to y on the left hand side and x on the right hand side: 


f g(y ) d y = J 


fix ) dx + C 


we will obtain the solution. 


Worked example 

Here is a worked example illustrating the process. 
We arc asked to solve 


dx 


Separating 


Integrating 


— = (3x 2 ) dx 

y 




lny = x 3 + C 


y = e 


,x 3 +C 


Letting k = e c where k is a constant we obtain 
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y = ke x 


which is the general solution. 


Verification 

This step does not need to be paid of your work, but if you want to check your solution, you can 
verify your answer by differentiation. 

We obtained 


y = lce x3 


as the solution to 


dy 

dx 


= 3 x 2 y 


Differentiating our solution with respect to x, 


A =3 

dx 


And since y=ke 


we can write 


dx 


We see that we obtain our original differential equation, thus our work must be correct. 


55.3.2 Homogeneous equations 

A homogeneous equation is in the form 


dy_ 

dx 


= f(yix) 


This looks difficult as it stands, however we can utilize the substitution 



so that we are now dealing with F(v) rather than F(y/x). 

Now we can express y in terms of v, as y=xv and use the product rule. 
The equation above then becomes, using the product rule 
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dy _ dv 
dx V + X dx 


Then 


dv 

v + x— = f(v) 
dx 


dv 

x— = f{v)-v 
dx 


dv _ f[v) - v 
dx x 

which is a separable equation and can be solved as above. 

However let’s look at a worked equation to see how homogeneous equations arc solved. 


Worked example 

We have the equation 


dy _ y 2 + x 2 
dx yx 

This does not appear to be immediately separable, but let us expand to get 

dy = z!_+ 

dx yx yx 


dy x y 
dx y x 


Substituting y=xv which is the same as substituting v=ylx: 


dy 

dx 


= l/v+v 


Now 


dv 

v + x— = l/v+v 
dx 


Canceling v from both sides 
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dv 

x— — = II v 
dx 


Separating 


Integrating both sides 


vdv = dxlx 


1 9 

- v 2 + C = ln(x) 

2 

i(^ lnw -c 

y 2 = 2x 2 ln(x) - 2Cx 2 


y = x\/2ln(x) -2 C 


which is our desired solution. 


55.3.3 Linear equations 

A linear first order differential equation is a differential equation in the form 

dy 

a(x) — + b(x)y = c(x) 
dx 

Multiplying or dividing this equation by any non-zero function of x makes no difference to its 
solutions so we could always divide by a(x ) to make the coefficient of the differential 1, but writing 
the equation in this more general form may offer insights. 

At first glance, it is not possible to integrate the left hand side, but there is one special case. If b 
happens to be the differential of a then we can write 


dy dy dad 

£?(x)— + o(x)y = a(x)— + y— = — a(x)y 
dx dx dx dx 


and integration is now straightforward. 

Since we can freely multiply by any function, lets see if we can use this freedom to write the left 
hand side in this special form. 

We multiply the entire equation by an arbitrary, 7(x), getting 
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dy , 

al— + blv - cl 
dx 


then impose the condition 


d 

— al=bl 
dx 


If this is satisfied the new left hand side will have the special form. Note that multiplying /by any 
constant will leave this condition still satisfied. 


Rearranging this condition gives 


Idl _ b-f x 
I dx a 

We can integrate this to get 

, f b{z) , , k rm d7 

ln/(x) = I dz-\na{x) + c /(x) = e-’ 

J a(z ) a(x) 

We can set the constant kto be 1, since this makes no difference. 

Next we use / on the original differential equation, getting 

jm dz dy jm d zb(x) jm dz c(x) 

qJ a(z) 1- QJ a(z) V = a{z ) 

dx a{x ) a{x) 

Because we’ve chosen I to put the left hand side in the special form we can rewrite this as 

— (ye J a < z ) j = e J a < 2 ) 

dx a{x) 

Integrating both sides and dividing by / we obtain the final result 

r b(z) j I r rMzl dz c(x) ) 

y=e Jaiz) az \l e J a{z) dz dx+C\ 

[J aCx) ) 

We call I an integrating factor. Si mi lar techniques can be used on some other calclulus problems. 

Example 

Consider 


dy 

— + ytanx= 1 
dx 


y( 0 ) = 0 
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First we calculate the integrating factor. 

I = e f tanxdx = gin sec x _ secx 

Multiplying the equation by this gives 


dy 

secx— + ysecxtanx = secx 
dx 


or 


We can now integrate 


d 

— ysecx = secx 
dx 


y- cosx seczdz = cosxln(secx + tanx) 
Jo 


55.3.4 Exact equations 

An exact equation is in the form 
f(x, y)dx+g(x, y)dy=0 
and, has the property that 
D x f = D y g 

(If the differential equation does not have this property then we can’t proceed any further). 

As a result of this, if we have an exact equation then there exists a function h(x, y) such that 
D y h = f and D x h = g 
So then the solutions are in the form 
h(x, y) = c 

by using the fact of the total differential. We can find then h(x, y) by integration 

55.4 Basic second and higher order ODE’s 

The generic solution of a n lh order ODE will contain n constants of integration. To calculate them 
we need n more equations. Most often, we have either 

boundary conditions, the values of y and its derivatives take for two different values of 
x 

or 
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initial conditions, the values of y and its first n-1 derivatives take for one particular 
value of x. 


55.4.1 Reducible ODE’s 


1. If the independent variable, x, does not occur in the differential equation then its order can be 
lowered by one. This will reduce a second order ODE to first order. 


Consider the equation: 


F 


' dy d 2 y' 
dx’ dx 2 , 


= 0 


Define 



Then 


d 2 y _ du _du dy _ du 
dx 2 dx dy dx dy 

Substitute these two expression into the equation and we get 


F 


du 

y, u, • u 
dy 


=0 


which is a first order ODE 


Example 


Solve 


1 + 2y 2 D 2 y = 0 


if at x=0, y=Dy= 1 

First, we make the substitution, getting 


o du 

1 + 2 y z u— = 0 
dy 


This is a first order ODE. By rearranging terms we can separate the variables 
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udu- 

2y 2 


Integrating this gives 


u 2 12 = c+ l/2y 

We know the values of y and u when x=0 so we can find c 

c = u 2 /2 — l/2y = l 2 12 — 1/(2 • 1) = 112-1/2 = 0 
Next, we reverse the substitution 


dy 2 , 1 

-f =u 2 = - 

dx y 

and take the square root 

dy = + J_ 
dx ~ y/y 

To find out which sign of the square root to keep, we use the initial condition, Dp= I at x=0, again, 
and rule out the negative square root. We now have another separable first order ODE, 

dy _ 1 
dx sjy 

Its solution is 


2 5 - 
- y 2 = x + d 


Since y= 1 when x=0, d= 2/3, and 


2. If the dependent variable, y, does not occur in the differential equation then it may also be reduced 
to a first order equation. 

Consider the equation: 


I dy d 2 y 
' ’ dx’ dx 2 


Define 
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Then 



d 2 y du 
dx 2 dx 


Substitute these two expressions into the first equation and we get 


=0 

which is a first order ODE 


du 
x, u, — - 
. dx 


55.4.2 Linear ODEs 

An ODE of the form 


d n y d n l y 

—— + fli(x) - + ... + a n y = F(x) 

dx n dx n ~ l 

is called linear. Such equations are much simpler to solve than typical non-linear ODEs. Though 
only a few special cases can be solved exactly in terms of elementary functions, there is much that 
can be said about the solution of a generic linear ODE. A full account would be beyond the scope 
of this book 

If F(x)=0 for all x the ODE is called homogeneous 
Two useful properties of generic linear equations are 

1 . Any lineal - combination of solutions of an homogeneous linear equation is also a solution. 

2. If we have a solution of a nonhomogeneous linear equation and we add any solution of the 
coii'esponding homogenous linear equation we get another solution of the nonhomogeneous 
lineal' equation 


Variation of constants 

Suppose we have a linear ODE, 


d n y d n l y 

d^ + aiiX) d^ + - + any = ° 

and we know one solution, y=w(x) 

The other solutions can always be written as y-wz. This substitution in the ODE will give us terms 
involving every differential of z upto the n lh , no higher, so we’ll end up with an n th order linear 
ODE for z. 
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We know that z is constant is one solution, so the ODE for z must not contain a z term, which means 
it will effectively be an n-l lh order linear ODE. We will have reduced the order by one. 


Lets see how this works in practice. 


Example 


Consider 


d 2 y 2 dv 6 

^4 + --r---?y=o 
dx z x dx x z 


One solution of this is y=x 2 , so substitute y~zx? into this equation. 


I 


\ 


, d 2 z 


dz 


x“ — r- + 2x l-2z 

dx z dx 


2 

+ - 
x 


n dz 

x — + 2xz 
dx 


6 2 

— x 2 z = 0 
x z 


Rearrange and simplify. 


x 2 D 2 z + 6xDz = 0 


This is first order for Dz. We can solve it to get 


z = Ax 5 y = Ax 3 


Since the equation is linear we can add this to any multiple of the other solution to get the general 
solution. 


y- Ax 3 + Bx 2 


Linear homogeneous ODE’s with constant coefficients 

Suppose we have a ODE 

{D n + cii D n 3 + ... + u n —\D + floly = 0 
we can take an inspired guess at a solution (motivate this) 

y = e px 

For this function D n y=p n y so the ODE becomes 

[p n + flip” -1 + ... + fl„_ip + flo)y = 0 


359 



Ordinary Differential Equations 


y=0 is a trivial solution of the ODE so we can discard it. We are then left with the equation 

p n + d\ p n ~ l + ... + a„-ip + ao ) = 0 
This is called the characteristic equation of the ODE. 

It can have up to n roots, pi , P 2 &hellip; p n , each root giving us a different solution of the ODE. 

Because the ODE is linear, we can add all those solution together in any linear combination to get a 
general solution 


y = Aie PlX + A 2 e P2X + ... + A n e PnX 

To see how this works in practice we will look at the second order case. Solving equations like this 
of higher order uses exactly the same principles; only the algebra is more complex. 

Second order 

If the ODE is second order, 


D 2 y + bDy + cy = 0 

then the characteristic equation is a quadratic, 


with roots 


p 2 + bp + c = 0 


-b ± \J b 2 - 4c 
P± = 2 

What these roots are like depends on the sign of Ir-Ac, so we have three cases to consider. 

1) b 2 > 4c 

In this case we have two different real roots, so we can write down the solution straight away. 


y = A+ e p+ + A- e p - 


2) b 2 <4c 

In this case, both roots arc imaginary. We could just put them directly in the formula, but if we arc 
interested in real solutions it is more useful to write them another way. 

Defining k 2 =4c-b 2 , then the solution is 


jh. Y _bx — i Jr y— — 

y = A + e ^ 2 + A- e llcx 2 
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For this to be real, the A’s must be complex conjugates 


A+ = Ae ±ia 

Make this substitution and we can write, 

y = Ae~ bxl2 cos (fcjc + a) 

If b is positive, this is a damped oscillation. 

3) b 2 -4c 

In this case the characteristic equation only gives us one root, p=-b/2. We must use another method 
to find the other solution. 

We’ll use the method of variation of constants. The ODE we need to solve is, 

D 2 y-2pDy + p 2 y = 0 

rewriting b and c in terms of the root. From the characteristic equation we know one solution is 
y _ e px SQ we ma | ce the substitution y = ze px , giving 

ief ,x D 2 z + 2peP x Dz + p 2 eP x z)-2p{eP x Dz + pe px z) + p 2 e px z = 0 
This simplifies to D 2 z=0, which is easily solved. We get 


z = Ax + B y= [Ax + B)e px 
so the second solution is the first multiplied by x. 

Higher order linear constant coefficient ODE’s behave similarly: an exponential for every real root 
of the characteristic and a exponent multiplied by a trig factor for every complex conjugate pair, 
both being multiplied by a polynomial if the root is repeated. 

E.g., if the characteristic equation factors to 


(p-l) 4 (p-3)(p 2 + 1) 2 = 0 
the general solution of the ODE will be 

y = (A + Bx + Cx 2 + Dx 3 )e x + Ee 3x + Fcos(x + a) + Gxcos(x + b) 
The most difficult part is finding the roots of the characteristic equation. 
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Linear nonhomogeneous ODEs with constant coefficients 

First, let’s consider the ODE 


Dy-y=x 

a nonhomogeneous first order ODE which we know how to solve. 
Using the integrating factor e~ x we find 


y=ce x + 1- x 

This is the sum of a solution of the corresponding homogeneous equation, and a polynomial. 
Nonhomogeneous ODE’s of higher order behave similarly. 

If we have a single solution, y p of the nonhomogeneous ODE, called a particular solution, 

[D n + ai D n ~ l + • • • + a n )y = F(x) 

then the general solution is y=y p +yh, where y/, is the general solution of the homogeneous ODE. 

Find y p for an arbitrary F(x) requires methods beyond the scope of this chapter, but there are some 
special cases where finding y p is straightforward. 

Remember that in the first order problem y p for a polynomial F(x) was itself a polynomial of the 
same order. We can extend this to higher orders. 

Example: 


Consider a particular solution 


D 2 y + y = x 3 -x + 1 


y p = bo + bix + b 2 X 2 + x 3 
Substitute for y and collect coefficients 

x 3 + b 2 X 2 + (6+ b\)x + ( 2^2 + bo) = x 3 -x + 1 
So b2-0, bi--7, bo-1, and the general solution is 

y = asinx + bcosx+ 1 - 7x + x 3 

This works because all the derivatives of a polynomial are themselves polynomials. 
Two other special cases are 
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Fix) = P n e kx y p (x) = Q n e kx 

F (x) = A„ sin kx + B n cos lex y p (x) = P„ sin kx+Q„ cos kx 
where P n ,Q n ,A n , and B n are all polynomials of degree n. 

Making these substitutions will give a set of simultaneous linear equations for the coefficients of the 
polynomials. 

2 


2 http://en.wikibooks.org/wiki/ Category%3ACalculus%2 0%2 8book%2 9 
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56 Partial Differential Equations 


56.1 Introduction 

56.2 First order 


Any partial differential equation of the form 


du 


+ /?2 - 


du 


dx\ “ 3x2 


■ ' • • + Hr 


du 
' dx, 


= b 


where h\, hq &hellip; h a , and b are all functions of both u and R n can be reduced to a set of ordinary 
differential equations. 

To see how to do this, we will first consider some simpler problems. 


56.2.1 Special cases 

We will start with the simple PDE 


u z {x,y,z] = u{x,y,z) (1) 

Because u is only differentiated with respect to z, for any fixed x and y we can treat this like the 
ODE, du/dz=u. The solution of that ODE is C(f\ where c is the value of u when z=0, for the fixed x 
andy 

Therefore, the solution of the PDE is 


u{x,y,z) = u(x,y, 0)e z 

Instead of just having a constant of integration, we have an arbitrary function. This will be true for 
any PDE. 

Notice the shape of the solution, an arbitrary function of points in the xy, plane, which is normal to 
the ’z’ axis, and the solution of an ODE in the ’z’ direction. 

Now consider the slightly more complex PDE 


a x u x + a y u y + a z u z = h{u) (2) 
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where h can be any function, and each a is a real constant. 

We recognize the left hand side as being a&nabla;, so this equation says that the differential of u 
in the a direction is h(u). Comparing this with the first equation suggests that the solution can be 
written as an arbitrary function on the plane normal to a combined with the solution of an ODE. 

Remembering from Calculus/Vectors 1 that any vector r can be split up into components par- 
allel and perpendicular to a , 


' (r-a)al (r-a)a 

r = r_i + rii = r — H — 

" lal 2 J |a| 2 

we will use this to split the components of r in a way suggested by the analogy with (1). 
Let’s write 


ra 

r = (x, y, z) = rj_ + 5a s = 

a- a 

and substitute this into (2), using the chain rule. Because we arc only differentiating in the a direc- 
tion, adding any function of the perpendicular vector to s will make no difference. 

First we calculate grad s, for use in the chain rule, 



On making the substitution into (2), we get, 


h[u) = a- Vs 


d_ 

ds 


u(s) = 


a • a cl du 

— — u{s) = — 

3 • 3 Cl S CIS 


which is an ordinary differential equation with the solution 


s = c(rj_) + 



dt 

h{t) 


The constant c can depend on the perpendicular components, but not upon the parallel coordinate. 
Replacing s with a monotonic scalar function of 5 multiplies the ODE by a function of 5, which 
doesn’t affect the solution. 

Example: 


u{x, t) x = uix, t) t 

For this equation, a is (1, -1), s-x-t, and the perpendicular vector is (x+t)(l, 1). The reduced ODE 
is dulds-0 so the solution is 

u=f(x+t) 


1 Chapter 52 on page 295 
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To find /we need initial conditions on u. Arc there any constraints on what initial conditions arc 
suitable? 

Consider, if we arc given 

• u(x.O). this is exactly fix), 

• u{ 3 1, t ) , this is fiAt) and fi t) follows immediately 

• u(f ? +2r,f), this is fit’+3t) and fit) follows, on solving the cubic. 

• u{-t,t), then this is /(()), so if the given function isn’t constant we have a inconsistency, and if it is 
the solution isn’t specified off the initial line. 

Similarly, if we arc given u on any curve which the lines x+t-c intersect only once, and to which 
they arc not tangent, we can deduce fi 

For any first order PDE with constant coefficients, the same will be true. We will have a set of lines, 
parallel to /-at. along which the solution is gained by integrating an ODE with initial conditions 
specified on some surface to which the lines aren’t tangent. 

If we look at how this works, we’ll see we haven’t actually used the constancy of a, so let’s drop 
that assumption and look for a similar solution. 

The important point was that the solution was of the form u=fix(s),y(s)), where (x(s),y(s)) is the 
curve we integrated along — a straight line in the previous case. We can add constant functions of 
integration to 5 without changing this form. 

Consider a PDE, 


a{x, y ) u x + b{x, y ) u y = c(x, y, u) 


For the suggested solution, u=fix{s),y{s)), the chain rule gives 


du 

ds 


dx dy 

U x H U 

ds ' ds 


y 


Comparing coefficients then gives 


dx dy 

— = a{x,y) — = b(x,y ) 

CIS CIS 


du 

— = c(x, y, u ) 
ds 


so we’ve reduced our original PDE to a set of simultaneous ODE’s. This procedure can be reversed. 
The curves (x(.s'),Ks)j are called characteristics of the equation. 

Example: Solve yu x = xu v given 11 = fi x) for x>0 The ODE’s are 


dx dy du 

ds ds ds 


subject to the initial conditions at s=0. 


x(0) = r y(0) = 0 u{ 0) = f{r) r > 0 
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This ODE is easily solved, giving 


x(s) = rcoss y{s) = r sins u(s) = f[r) 

so the characteristics are concentric circles round the origin, and in polar coordinates u(r,9)=J{r) 

Considering the logic of this method, we see that the independence of a and b from u has not been 
used either, so that assumption too can be dropped, giving the general method for equations of this 
quasilinear form. 


56.2.2 Quasilinear 

Summarising the conclusions of the last section, to solve a PDE 

du du du 

a,] [u,x] 1 a2(u,x) + a n (u,x) = b(u,x) 

dx i 0x2 dx n 

subject to the initial condition that on the surface, (xi(ri,&hellip;,q^i, &hellip;x n (ri,&hellip;,q 7 _i), 
u=f( q ,&hellip;,q 7 _| ) —this being an arbitrary paremetrisation of the initial surface— 

• we transform the equation to the equivalent set of ODEs, 



dx n 

ds 


— a n 


du 

— = b 
ds 


subject to the initial conditions 

x/(0) = /(n,...,r„_i) w = /(ri,r 2 ,...r n _i) 

• Solve the ODE’s, giving x,- as a function of s and the q. 

• Invert this to get 5 and the q as functions of the x,-. 

• Substitute these inverse functions into the expression for u as a function of s and the q obtained 
in the second step. 

Both the second and third steps may be troublesome. 

The set of ODEs is generally non-linear and without analytical solution. It may even be easier to 
work with the PDE than with the ODEs. 

In the third step, the q together with s form a coordinate system adapted for the PDE. We can only 
make the inversion at all if the Jacobian of the transformation to Cartesian coordinates is not zero. 


dxi dx-i 

dri ‘ ‘ ‘ dr„_! 

dx n dx n 

dri dr„_! 


U\ 


7^ 0 


<2/7 
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This is equivalent to saying that the vector (a ; , &hellip:, a n ) is never in the tangent plane to a surface 
of constant 5 . 

If this condition is not false when 5=0 it may become so as the equations arc integrated. We will 
soon consider ways of dealing with the problems this can cause. 

Even when it is technically possible to invert the algebraic equations it is obviously inconvenient to 
do so. 


Example 

To see how this works in practice, we will 


a / consider the PDE, 


uu x + u y + u t = 0 


with generic initial condition. 


u = fix, y) on t = 0 

Naming variables for future convenience, the corresponding ODE’s are 

dx dy 1 dt 1 du 

dx dx dx dx 

subject to the initial conditions at r=0 


x = r y = s t = 0 u = fir, 5 ) 


These ODE’s arc easily solved to give 


x=r + fir,s)x y = s + x t=x u = fir,s ) 

These arc the parametric equations of a set of straight lines, the characteristics. 
The determinant of the Jacobian of this coordinate transformation is 


1 + T^ 
1 + T dr 


0 

0 



1 

0 


/ 

1 

1 


df 
l + T ~dr 


This determinant is 1 when f=0, but if f- is anywhere negative this determinant will eventually be 
zero, and this solution fails. 

In this case, the failure is because the surface sf r = — 1 is an envelope of the characteristics. 

For arbitrary /we can invert the transformation and obtain an implicit expression for u 
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u = fix- tu,y-x) 


If /is given this can be solved for u. 

1/ fix, y) = ax, The implicit solution is 


u = aix - tu) => u= 

1 + at 

This is a line in the u-x plane, rotating clockwise as l increases. If a is negative, this line eventually 
become vertical. If a is positive, this line tends towards u= 0, and the solution is valid for all t. 

2! fx,y)=xr, The implicit solution is 


? 1 + 2 tx- \/l + 4 tx 

u=ix- tu) => u = = 

2t 2 

This solution clearly fails when 1 + 4 1 x < 0, which is just when sf r = - 1 . For any t > 0 this happens 
somewhere. As t increases this point of failure moves toward the origin. 

Notice that the point where u= 0 stays fixed. This is true for any solution of this equation, whatever 
/is. 

We will see later that we can find a solution after this time, if we consider discontinuous solutions. 
We can think of this as a shockwave. 

3/ fix,y) = sin(xy) 

The implicit solution is 


uix, y, t) = sin ((x - t u) iy - x)) 

and we can not solve this explitely for u. The best we can manage is a numerical solution of this 
equation. 

b/We can also consider the closely related PDE 


uu x + Uy+u t = y 


The corresponding ODE’s are 


dx _ dy _ dz 

dx dx dx 



subject to the initial conditions at r=0 


x=r y = s t = 0 u = fir,s) 
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These ODE’s are easily solved to give 

x = r + rf+ -ST 2 + -t 3 v = s + t t = r u=f + ST + -T 2 
J 2 6 2 

Writing /in terms of u, s, and r, then substituting into the equation for x gives an implicit solution 

u{x,y, t ) = fix - ut+ -yt 2 -~t 3 ,y-t) + yt--t 2 
2 6 2 


It is possible to solve this for u in some special cases, but in general we can only solve this equation 
numerically. However, we can learn much about the global properties of the solution from further 
analysis 


Characteristic initial value problems 

What if initial conditions are given on a characteristic, on an envelope of characteristics, on a surface 
with characteristic tangents at isolated points? 


Discontinuous solutions 

So far, we’ve only considered smooth solutions of the PDE, but this is too restrictive. We may 
encounter initial conditions which aren’t smooth, e.g. 


u t = cu x u(x, 0) 


r i, x > o 
jo, x<0 


If we were to simply use the general solution of this equation for smooth initial conditions. 


w(x, t ) = u{x + ct, 0) 


we would get 


u{x , t ) 


jl, x + cf>0 
jo, x+ct<0 


which appears to be a solution to the original equation. However, since the part ial differentials are 
undefined on the characteristic x+ct= 0, so it becomes unclear what it means to say that the equation 
is true at that point. 

We need to investigate further, starting by considering the possible types of discontinuities. 

If we look at the derivations above, we see we’ve never use any second or higher order derivatives 
so it doesn’t matter if they aren’t continuous, the results above will still apply. 

The next simplest case is when the function is continuous, but the first derivative is not, e.g. Ixl. 
We’ll initially restrict ourselves to the two-dimensional case, u{x, t) for the generic equation. 
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a(x,t)u x + b{x,t)u t = c{u,x,t) (1) 

Typically, the discontinuity is not confined to a single point, but is shared by all points on some 
curve, (jgo(s). Us ) ) 

Then we have 


x>x 0 lim XXo = u+ 
x < x 0 lim XXo = U - 

We can then compare u and its derivatives on both sides of this curve. 

It will prove useful to name the jumps across the discontinuity. We say 

[u\ = u+-u- [u x ] = u x+ - u x - [u t ] = u t+ -u t - 

Now, since the equation (1) is true on both sides of the discontinuity, we can see that both u + and 
il, being the limits of solutions, must themselves satisfy the equation. That is, 

a(x, t)u +x + b(x, t)u+ t = c{u+,x,t) , x = xo(s) 

Wnprp 

a{x,t)u- x + b{x,t)u-t = c{u-,x,t) f=fo(s) 

Subtracting then gives us an equation for the jumps in the differentials 


a{x, t)[u x ] + b{x, t)[u t ] = 0 

We are considering the case where u itself is continuous so we know that [ «]=(). Differentiating this 
with respect to 5 will give us a second equation in the differential jumps. 


dx o 
ds 


[u x \ + 


dx o 
dt 


[u t ] =0 


The last two equations can only be both true if one is a multiple of the other, but multiplying s 
by a constant also multiplies the second equation by that same constant while leaving the curve of 
discontinuity unchanged, hence we can without loss of generality define 5 to be such that 


dx o dto 

~d7 = a ~Ts =b 

But these are the equations for a characteristic, i.e. discontinuities propagate along characteris- 
tics. We could use this property as an alternative definition of characteristics. 

We can deal similarly with discontinuous functions by first writing the equation in conservation 
form , so called because conservation laws can always be written this way. 


(au) x + {bu) t = a x u+b t u + c (1) 
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Notice that the left hand side can be regarded as the divergence of ( au , bu). Writing the equation 
this way allows us to use the theorems of vector calculus. 

Consider a narrow strip with sides parallel to the discontinuity and width h 

We can integrate both sides of (1) over R, giving 

/ ( au) x + [bu) t dxdt = / [a x + b r )u + cdxdt 
JR JR 

Next we use Green’s theorem to convert the left hand side into a line integral. 


<p audt- budx= I [a x + b t )u + cdxdt 
JdR JR 

Now we let the width of the strip fall to zero. The right hand side also tends to zero but the left hand 
side reduces to the difference between two integrals along the part of the boundary of R parallel to 
the curve. 


/' 


-I 


au+dt- bu+dx- I au-dt - bu-dx = 0 


The integrals along the opposite sides of R have different signs because they are in opposite direc- 
tions. 

For the last equation to always be true, the integrand must always be zero, i.e. 


/ 




d to dx o 1 

a lTs~ Hi 


l u ] = 0 


Since, by assumption [ u\ isn’t zero, the other factor must be, which immediately implies the curve 
of discontinuity is a characteristic. 

Once again, discontinuities propagate along characteristics. 

Above, we only considered functions of two variables, but it is straightforward to extend this to 
functions of n variables. 

The initial condition is given on an n - 1 dimensional surface, which evolves along the characteristics. 
Typical discontinuities in the initial condition will lie on a n - 2 dimensional surface embedded within 
the initial surface. This surface of discontinuity will propagate along the characteristics that pass 
through the initial discontinuity. 

The jumps themselves obey ordinary differential equations, much as u itself does on a characteristic. 
In the two dimensional case, for u continuous but not smooth, a little algebra shows that 


d[u x \ 

ds 


lu x ] 


' dc b x \ 

t a ~ ; — a x 

< du b ) 


while u obeys the same equation as before, 
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du 

~ds ~ C 

We can integrate these equations to see how the discontinuity evolves as we move along the charac- 
teristic. 

We may find that, for some future s, [ u x ] passes through zero. At such points, the discontinuity has 
vanished, and we can treat the function as smooth at that characteristic from then on. 

Conversely, we can expect that smooth functions may, under the righr circumstances, become dis- 
continuous. 

To see how all this works in practice we'll consider the solutions of the equation 


u r +uu x = 0 w(x,0) = /(x) 


for three different initial conditions. 

The general solution, using the techniques outlined earlier, is 


u = fix- tu) 

u is constant on the characteristics, which arc straight lines with slope dependent on u. 

First consider / such that 

{ 1 x > a 

| a > x > 0 a > 0 
0 x < 0 

While u is continuous its derivative is discontinuous at x=0, where w=0. and at x=a, where n= 1 . The 
characteristics through these points divide the solution into three regions. 

All the characteristics to the right of the characteristic through x— a, /=() intersect the x-axis to the 
right of x— I , where u= 1 so u is 1 on all those characteristics, i.e whenever x-t>a. 

Similarly the characteristic through the origin is the line x=0. to the left of which u remains zero. 

We could find the value of u at a point in between those two characteristics either by finding which 
intermediate characteristic it lies on and tracing it back to the initial line, or via the general solution. 

Either way, we get 

) 1 x - t> a 
~a+t a+t>x> 0 
0 x < 0 
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At larger t the solution u is more spread out than at £=0 but still the same shape. 

We can also consider what happens when a tends to 0, so that u itself is discontinuous at x=0. 

If we write the PDE in conservation form then use Green’s theorem, as we did above for the linear 
case, we get 


[u\ 


dx o 
ds 



d to 
ds 


| m 2 | is the difference of two squares, so if we take we get 


dx o 
dt 


1 

-(«_ + u+) 
2 


In this case the discontinuity behaves as if the value of u on it were the average of the limiting values 
on either side. 

However, there is a caveat. 

Since the limiting value to the left is u. the discontinuity must lie on that characteristic, and similarly 
for u+; i.e the jump discontinuity must be on an intersection of characteristics, at a point where u 
would otherwise be multivalued. 

For this PDE the characteristic can only intersect on the discontinuity if 


U- > u + 


If this is not true the discontinuity can not propagate. Something else must happen. 

The limit a= 0 is an example of a jump discontinuity for which this condition is false, so we can see 
what happens in such cases by studying it. 

Taking the limit of the solution derived above gives 


fix) = 



x > t 
t> x> 0 
x< 0 


If we had taken the limit of any other sequence of initials conditions tending to the same limit we 
would have obtained a trivially equivalent result. 

Looking at the characteristics of this solution, we see that at the jump discontinuity characteristics 
on which u takes every value betweeen 0 and 1 all intersect. 

At later times, there are two slope discontinuities, at x=0 and x=t, but no jump discontinuity. 

This behaviour is typical in such cases. The jump discontinuity becomes a pair of slope discontinu- 
ities between which the solution takes all appropriate values. 

Now, lets consider the same equation with the initial condition 
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1 1 x < 0 

fix ) = < 1- ^ a > x> 0 
l 0 x> a 


a > 0 


This has slope discontinuities at x=0 and x=a, dividing the solution into three regions. 

The boundaries between these regions are given by the characteristics through these initial points, 
namely the two lines 


x = t x = a 


These characteristics intersect at t= a, so the nature of the solution must change then. 
In between these two discontinuities, the characteristic through x=b at /=() is clearly 


x = (l \t + b 0 <b<a 


\ aj 

All these characteristics intersect at the same point, ( x,t)=(a,a ). 

We can use these characteristics, or the general solution, to write u for tea 


11 x< t 

u{x, t) = < a>x>t a>t>0 
l 0 x > a 

As t tends to a, this becomes a step function. Since u is greater to the left than the right of the 
discontinuity, it meets the condition for propagation deduced above, so for t>a u is a step function 
moving at the average speed of the two sides. 


f 1 x < 

i<(x,f) = j 0 x> a±r t>a> 0 

This is the reverse of what we saw for the initial condition previously considered, two slope discon- 
tinuities merging into a step discontinuity rather than vice versa. Which actually happens depends 
entirely on the initial conditions. Indeed, examples could be given for which both processes happen. 

In the two examples above, we started with a discontinuity and investigated how it evolved. It is 
also possible for solutions which are initially smooth to become discontinuous. 

For example, we saw earlier for this particular PDE that the solution with the initial condition it=x? 
breaks down when 2xi + 1 =0. At these points the solution becomes discontinuous. 

Typically, discontinuities in the solution of any partial differential equation, not merely ones of first 
order, arise when solutions break down in this way and propagate similarly, merging and splitting 
in the same fashion. 
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56.2.3 Fully non-linear PDEs 

It is possible to extend the approach of the previous sections to reduce any equation of the form 


Fix i,x 2 , . . .,x n , u, u Xv u X2 ,. . . , u Xn ) = 0 
to a set of ODE’s, for any function, F. 

We will not prove this here, but the corresponding ODE’s are 


dxi dF du,j 

dr duj dr 


dF d F\ 

— b lli — — 

[dxi du) 


du JL dF 

~r = E 

Cl T i=\ O lit 


If u is given on a surface parameterized by /'] &hcllip;r„ then we have, as before, n initial conditions 
on the n, Xj 


r = 0 xt = f i {r 1 ,r 2 ,...,r n - 1 ) 

given by the parameterization and one initial condition on u itself, 

r = 0 u = fir\, r 2 ,...,r n -i) 

but, because we have an extra n ODEs for the uf s, we need an extra n initial conditions. 
These arc, n- 1 consistency conditions. 


r = 0 


df 

dr\ 


n - 1 

E u i 

7=1 


d A 

Orj 


which state that the u{ s arc the part ial derivatives of u on the initial surface, and one initial condition 


T = 0 F[Xi,X 2 ,...,X n , u, Ml, u 2 ,...,u n ) =0 

stating that the PDE itself holds on the initial surface. 

These n initial conditions for the Uj will be a set of algebraic equations, which may have multiple 
solutions. Each solution will give a different solution of the PDE. 


Example 

Consider 


u t =u x +Uy, u{x,y,0) = x 2 + y 2 

The initial conditions at r=0 are 
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x = r y= s t = 0 u= r 2 + s 2 

u x = 2r u y = 2s u t = 4(r 2 + s 2 ) 

and the ODE’s arc 


dx 

dj 


= “2 Ur 


dy - _ 2 u 

dr ~ 


dt 

dr 


= 1 


d iix 
dr 


= 0 


du y _ n _ n 

dr _ u dr “ u 


= u t -2{u 2 x +u 2 ) 


Note that the partial derivatives arc constant on the characteristics. This always happen when the 
PDE contains only partial derivatives, simplifying the procedure. 

These equations arc readily solved to give 


x=r( 1-4t) y = s(l-4r) t = T u = (r 2 + s 2 )(l -4 t) 
On eliminating the parameters we get the solution. 


which can easily be checked, abc 


56.3 Second order 

Suppose we arc given a second order linear PDE to solve 

a{x, y ) u xx + b{x, y) u xy + c{x, y) u yy = d{x, y) u x + e(x, y) u y + p{x, y) u + q{x, y) (1) 

The natural approach, after our experience with ordinary differential equations and with simple 
algebraic equations, is attempt a factorisation. Let’s see how for this takes us. 

We would expect factoring the left hand of (1) to give us an equivalent equation of the form 


a{x, y) {D x + a+ ( x , y)D y ) (D x + a_ (x, y)D y ) u 

and we can immediately divide through by a. This suggests that those particular - combinations of 
first order derivatives will play a special role. 

Now, when studying first order PDE’s we saw that such combinations were equivalent to the deriva- 
tives along characteristic curves. Effectively, we changed to a coordinate system defined by the 
characteristic curve and the initial curve. 

Here, we have two combinations of first order derivatives each of which may define a different 
characteristic curve. If so, the two sets of characteristics will define a natural coordinate system for 
the problem, much as in the first order case. 

In the new coordinates we will have 
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D x + a+ (x, y)D y = D r D x + a- (x, y)D y = D s 

with each of the factors having become a differentiation along its respective characteristic curve, 
and the left hand side will become simply u rs giving us an equation of the form 


u rs = A(r,s)u r + B{r,s) u s + C{r,s)u + D{r,s) 

If A, B, and C all happen to be zero, the solution is obvious. If not, we can hope that the simpler 
form of the left hand side will enable us to make progress. 

However, before we can do all this, we must see if (1) can actually be factored. 

Multiplying out the factors gives 

u xx + a(x’y) u xy + a ( x ’y) u yy = u xx + (a+ + a_) u xy + a + a_ Uyy 

On comparing coefficients, and solving for the a’s we see that they are the roots of 


a{x,y)a 2 + b{x,y)a + c{x,y) = 0 

Since we arc discussing real functions, we arc only interested in real roots, so the existence of the 
desired factorization will depend on the discriminant of this quadratic equation. 

• If b{x,y) 2 > 4a(x,y)c{x,y) 

then we have two factors, and can follow the procedure outlined above. Equations like 
this arc called hyperbolic 

• If b{x,y) 2 = 4a(x,y)c(x,y) 

then we have only factor, giving us a single characteristic curve. It will be natural to 
use distance along these curves as one coordinate, but the second must be determined 
by other considerations. 

The same line of argument as before shows that use the characteristic curve this way 
gives a second order term of the form u rr , where we’ve only taken the second derivative 
with respect to one of the two coordinates. Equations like this arc called parabolic 

• If b{x,y) 2 < 4a{x,y)c(x,y) 

then we have no real factors. In this case the best we can do is reduce the second order 
terms to the simplest possible form satisfying this inequality, i.e u n -+ u ss 
It can be shown that this reduction is always possible. Equations like this are called 
elliptic 

It can be shown that, just as for first order PDEs, discontinuities propagate along characteristics. 
Since elliptic equations have no real characteristics, this implies that any discontinuities they may 
have will be restricted to isolated points; i.e., that the solution is almost everywhere smooth. 

This is not true for hyperbolic equations. Their behavior is largely controlled by the shape of their 
characteristic curves. 
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These differences mean different methods arc required to study the three types of second equation. 
Fortunately, changing variables as indicated by the factorisation above lets us reduce any second 
order PDE to one in which the coefficients of the second order terms arc constant, which means it 
is sufficient to consider only three standard equations. 


Uxx Uyy — 0 ^ I xx I ly y — 0 Oxx tly ~ 0 

We could also consider the cases where the right hand side of these equations is a given function, or 
proportional to u or to one of its first order derivatives, but all the essential properties of hyperbolic, 
parabolic, and elliptic equations are demonstrated by these three standard forms. 

While we’ve only demonstrated the reduction in two dimensions, a similar reduction applies in 
higher dimensions, leading to a si mi lar classification. We get, as the reduced form of the second 
order terms. 


d 2 i 


d 2 i 


a \ t t + ^2 t— y 
OXf OX 2 


H" • • * + CLtq 


d 2 u 
dx 2 


where each of the ay is equal to either 0, +1, or -1. 

If all the ay have the same sign the equation is elliptic 
If any of the ay are zero the equation is parabolic 

If exactly one of the ay has the opposite sign to the rest the equation is hyperbolic 

In 2 or 3 dimensions these arc the only possibilities, but in 4 or more dimensions there is a fourth 
possibility: at least two of the ay are positive , and at least two of the ay arc negative. 

Such equations arc called ultrahyperbolic. They arc less commonly encountered than the other 
three types, so will not be studied here. 

When the coefficients arc not constant, an equation can be hyperbolic in some regions of the xy 
plane, and elliptic in others. If so, different methods must be used for the solutions in the two 
regions. 


56.3.1 Elliptic 

Standard form, Laplace’s equation: 


V 2 h = 0 

Quote equation in spherical and cylindrical coordinates, and give full solution for cartesian and 
cylindrical coordinates. Note averaging property Comment on physical significance, rotation in- 
variance of laplacian. 
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56.3.2 Hyperbolic 

Standard form, wave equation: 

V 2 h = c 2 h tt 

Solution, any sum of functions of the form 

h = /(k-x-mf) to = |k|c 

These are waves. Compare with solution from separating variables. Domain of dependence, etc. 

56.3.3 Parabolic 

The canonical parabolic equation is the diffusion equation: 


M 2 h=h t 

Here, we will consider some simple solutions of the one-dimensional case. 

The properties of this equation arc in many respects intermediate between those of hyperbolic and 
elliptic equation. 

As with hyperbolic equations but not elliptic, the solution is well behaved if the value is given on 
the initial surface /=(). 

However, the characteristic surfaces of this equation arc the surfaces of constant f, thus there is no 
way for discontinuities to propagate to positive t. 

Therefore, as with elliptic equations but not hyberbolic, the solutions arc typically smooth, even 
when the initial conditions aren't. 

Furthermore, at a local maximum of h, its Laplacian is negative, so h is decreasing with t, while at 
local minima, where the Laplacian will be positive, h will increase with t. Thus, initial variations in 
h will be smoothed out as t increases. 

In one dimension, we can learn more by integrating both sides, 


f-a h tdt = f-a h xxdx 

Ttf- a hdt = lhxt a 

Provided that h x tends to zero for large x, we can take the limit as a and b tend to infinity, deducing 

d , , 

— hdt 

Cl t J—oo 
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so the integral of h over all space is constant. 

This means this PDE can be thought of as describing some conserved quantity, initially concentrated 
but spreading out, or diffusing, over time. 

This last result can be extended to two or more dimensions, using the theorems of vector calculus. 

We can also differentiate any solution with respect to any coordinate to obtain another solution. E.g. 
if h is a solution then 


V 2 h x = d x V 2 h = d x d t h = d t h x 
so h x also satisfies the diffusion equation. 


Similarity solution 

Looking at this equation, we might notice that if we make the change of variables 


r = ax r = a 2 1 

then the equation retains the same form. This suggests that the combination of variables x 2 /?, which 
is unaffected by this variable change, may be significant. 

We therefore assume this equation to have a solution of the special form 

x 

h{x, t) = f(rj) where p = — 

1 2 


then 


h x — Vxfrj — t 2 frj ht -Tit ft)- f 7 

and substituting into the diffusion equation eventually gives 

fiv + 2 fn = 0 

which is an ordinary differential equation. 

Integrating once gives 


_rf 

f v = Ae 4 

Reverting to h, we find 
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h x = ^ 

h = ^f^e-^ds + B 
= Af^e^dz + B 

This last integral can not be written in terms of elementary functions, but its values are well known. 
In particular the limiting values of h at infinity are 

h(-oo,t)=B h(oo, t) = B + A\Jh, 
taking the limit as t tends to zero gives 

f B x < 0 
1 \B + As/n x>0 

We see that the initial discontinuity is immediately smoothed out. The solution at later times retains 
the same shape, but is more stretched out. 

The derivative of this solution with respect to x 

h A e - X ilAt 

Vi 

is itself a solution, with h spreading out from its initial peak, and plays a significant role in the 
further analysis of this equation. 

The same similarity method can also be applied to some non-linear equations. 


Separating variables 

We can also obtain some solutions of this equation by separating variables. 


Hx, t) = X{x)T{t)=>X”T = XT 


giving us the two ordinary differential equations 


d 2 X 

dx 2 


+ k 2 X = 0 


dT 

dt 


= —kT 


and solutions of the general form 


h{x,t) = Ae kt sm(kx+ a) 
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58 Systems of Ordinary Differential Equations 


We have already examined cases where we have a single differential equation and found several 
methods to aid us in finding solutions to these equations. But what happens if we have two or more 
differential equations that depend on each other? For example, consider the case where 

DfX(t) = 3 y{t) 2 + x{t) t 

and 

D t y{t) = x{t) + y{t) 

Such a set of differential equations is said to be coupled. Systems of ordinary differential equations 
such as these are what we will look into in this section. 

58.1 First order systems 

A general system of differential equations can be written in the form 

D f x = F(x, t) 

Instead of writing the set of equations in a vector, we can write out each equation explicitly, in the 
form: 

D f xi = F\(x\,...,x n , t) 


D t Xi = Fi(xi,...,x n ,t) 

If we have the system at the very beginning, we can write it as: 


where 


D t x = G(x, t) 
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and 


x= (x(f),y(f)) = (x,y) 


G(x, t ) = (3y 2 + xt,x + y) 
or write each equation out as shown above. 

Why are these forms important? Often, this arises as a single, higher order differential equation that 
is changed into a simpler form in a system. For example, with the same example, 

D t x{t) = 3y(t) 2 + x(f) t 


D t y{t) = x(f) + y(f) 

we can write this as a higher order differential equation by simple substitution. 

D t y{t)-y{t) = x(f) 

then 

D t x{t) = 3y(f) 2 + (D f y(f) - y(f)) t 
D t x{t) = 3 y(t) 2 + tD t y{t) - ty{t) 

Notice now that the vector form of the system is dependent on t since 

G(x, t) = (3y 2 + xt, x + y) 

the first component is dependent on t. However, if instead we had 

H(x) = (3y 2 + x, x + y) 

notice the vector field is no longer dependent on t. We call such systems autonomous. They appear 
in the form 

D t x = H(x) 

We can convert between an autonomous system and a non-autonomous one by simply making a 
substitution that involves t, such as y={x, t), to get a system: 
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D t y = (F(y), 1) = (F(x, t), 1) 

In vector form, we may be able to separate Fin a linear fashion to get something that looks like: 

F(x, t ) = /l(f)x + b(f) 

where A( t) is a matrix and b is a vector. The matrix could contain functions or constants, clearly, 
depending on whether the matrix depends on tor not. 

l 


1 http://en.wikibooks.org/wiki/ Category%3ACalculus%2 0%2 8book%2 9 
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59 Real numbers 


59.1 Fields 

You arc probably already familial - with many different sets of numbers from your past experience. 
Some of the commonly used sets of numbers are 

• Natural numbers, usually denoted with an N, are the numbers 0, 1,2,3,... 

• Integers, usually denoted with a Z, are the positive and negative natural numbers: ...-3,-2,- 

1 , 0 , 1 , 2 , 3 ... 

• Rational numbers, denoted with a Q, are fractions of integers (excluding division by zero): -1/3, 
5/1, 0, 2/7. etc. 

• Real numbers, denoted with a R, are constructed and discussed below. 

Note that different sets of numbers have different properties. In the set integers for example, any 
number always has an additive inverse: for any integer x, there is another integer t such that x+t = 0 
This should not be terribly surprising: from basic arithmetic we know that t = -x. Try to prove to 
yourself that not all natural numbers have an additive inverse. 

In mathematics, it is useful to note the important properties of each of these sets of numbers. The 
rational numbers, which will be of primary concern in constructing the real numbers, have the 
following properties: 

There exists a number 0 such that for any other number a. 0+a=a+0=a 

For any two numbers a and b, a+b is another number 

For any three numbers a,b, and c, a+( in- c)=( a+ b)+ c 

For any number a there is another number -a such that a+( -a)= 0 

For any two numbers a and b, a+b=b±a 

For any two numbers a and b,a*b is another number 

There is a number 1 such that for any number a , a*l=l*a=a 

For any two numbers a and lx a*b=b*a 

For any three numbers a,b and c, a(bc)=(ab)c 

For any three numbers a,b and c, a{b+c)=ab+ac 

For every number a there is another number a ] such thataa' 1 -! 

As presented above, these may seem quite intimidating. Flowever, these properties are nothing 
more than basic facts from arithmetic. Any collection of numbers (and operations + and * on those 
numbers) which satisfies the above properties is called a field. The properties above are usually 
called field axioms. As an exercise, determine if the integers form a field, and if not, which field 
axiom(s) they violate. 

Even though the list of field axioms is quite extensive, it does not fully explore the properties of 
rational numbers. Rational numbers also have an ordering.’ A total ordering must satisfy several 
properties: for any numbers a, b, and c 
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if a < b and b < a then a = b (antisymmetry) 
if a < b and b < c then a < c (transitivity) 
a< b or b < a (totality) 

To familiarize yourself with these properties, tty to show that (a) natural numbers, integers and ra- 
tional numbers arc all totally ordered and more generally (b) convince yourself that any collection of 
rational numbers arc totally ordered (note that the integers and natural numbers are both collections 
of rational numbers). 

Finally, it is useful to recognize one more characterization of the rational numbers: every rational 
number has a decimal expansion which is either repeating or terminating. The proof of this fact 
is omitted, however it follows from the definition of each rational number as a fraction. When 
performing long division, the remainder at any stage can only take on positive integer values smaller 
than the denominator, of which there are finitely many. 


59.2 Constructing the Real Numbers 

There are two additional tools which arc needed for the construction of the real numbers: the upper 
bound and the least upper bound. Definition A collection of numbers E is bounded above if there 
exists a number m such that for all x in E x< m. Any number m which satisfies this condition is 
called an upper bound of the set E. 

Definition If a collection of numbers E is bounded above with m as an upper bound of E, and all 
other upper bounds of E arc bigger than m, we call m the least upper bound or supremum of E, 
denoted by sup E. 

Many collections of rational numbers do not have a least upper bound which is also rational, al- 
though some do. Suppose the numbers 5 and 10/3 are, together, taken to be E. The number 5 is not 
only an upper bound of E, it is a least upper bound. In general, there arc many upper bounds (12, for 
instance, is an upper bound of the collection above), but there can be at most one least upper bound. 

Consider the collection of numbers {3,3.1,3.14,3.141,3.1415,...}: You may recognize these deci- 
mals as the first few digits of pi. Since each decimal terminates, each number in this collection is a 
rational number. This collection has infinitely many upper bounds. The number 4, for instance, is 
an upper bound. There is no least upper bound, at least not in the rational numbers. Try to convince 
yourself of this fact by attempting to construct such a least upper bound: (a) why does pi not work 
as a least upper bound (hint: pi does not have a repeating or terminating decimal expansion), (b) 
what happens if the proposed supremunr is equal to pi up to some decimal place, and zeros after (c) 
if the proposed supremunr is bigger than pi, can you find a smaller upper bound which will work? 

In fact, there arc infinitely many collections of rational numbers which do not have a rational least 
upper bound. We define a real number to be any number that is the least upper bound of some 
collection of rational numbers. 


59.3 Properties of Real Numbers 


The reals arc totally ordered. 
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For all reals; a , b, c 
Either b>a, b=a, or b<a. 
If a<b and b<c then a<c 


Also 


b>a implies b+c>a+c 
b>a and o 0 implies bc>ac 
b>a implies -a>-b 

Upper bound a xi om 

Every non-empty set of real numbers which is bounded above has a supremum. 

The upper bound axiom is necessary for calculus. It is not true for rational numbers. 

We can also define lower bounds in the same way. 

Definition A set E is bounded below if there exists a real M such that for all x&isin;F x>M Any M 
which satisfies this condition is called an lower bound of the set E 

Definition If a set, E, is bounded below, M is an lower bound of E, and all other lower bounds of E 
arc less than M, we call Mthe greatest lower bound or iniflmum of E, denoted by inf E 

The supremum and infimum of finite sets are the same as their maximum and minimum. 

Theorem 

Every non-empty set of real numbers which is bounded below has an infimum. 

Proof: 

Let E be a non-empty set of of real numbers, bounded below 

Let L be the set of all lower bounds of E 

L is not empty, by definition of bounded below 

Every element of E is an upper bound to the set L, by definition 

Therefore, L is a non empty set which is bounded above 

L has a supremum, by the upper bound axiom 

1/ Every lower bound of E is <sup L, by definition of supremum 

Suppose there were an e&isin;E such that e<sup L 

Every element of L is <e, by definition 

Therefore e is an upper bound of L and e<sup L 

This contradicts the definition of supremum, so there can be no such e. 

If e&isin;E then e>sup L, proved by contradiction 
2/ Therefore, sup L is a lower bound of E 

inf E exists, and is equal to sup L, on comparing definition of infinum to lines 1 & 2 

Bounds and inequalities, theorems: A Q B => sup A < supB A Q B => inf A > inffi supAu B = 
max(sup A, sup B) inf A u B = min (inf A, inf B) 

Theorem: (The triangle inequality) 


Va,b,ce \quad|a- b\ < \ a- c\ + \c- b\ 
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Proof by considering cases 

If a<b<c then |a-c|+|c-b| =(c-a)+(c-b)=2(c-b)+(b-a)>b-a=lb-al 
Exercise: Prove the other five cases. 

This theorem is a special case of the triangle inequality theorem from geometry: The sum of two 
sides of a triangle is greater than or equal to the third side. It is useful whenever we need to 
manipulate inequalities and absolute values, 
l 


1 http://en.wikibooks.org/wiki /Cate gor y%3ACalculus%2 0%2 8book%2 9 
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60 Complex Numbers 


. . /tmp/ 80 .png 


Figure 80: Complex Numbers 1 


In mathematics, a complex number is a number of the form 


a + bi 

where a and b are real numbers, and i is the imaginary unit, with the property i 2 = &minus;l. The 
real number a is called the real part of the complex number, and the real number b is the imaginary 
part. Real numbers may be considered to be complex numbers with an imaginary paid of zero; that 
is, the real number a is equivalent to the complex number a+Oi. 

For example, 3 + 2i is a complex number, with real paid 3 and imaginary paid 2. If z = a + bi, the 
real part ( a ) is denoted Re(z), or (z), and the imaginary paid ( b) is denoted Im(z), or (z). 

Complex numbers can be added, subtracted, multiplied, and divided like real numbers and have 
other elegant properties. For example, real numbers alone do not provide a solution for every poly- 
nomial algebraic equation with real coefficients, while complex numbers do (this is the fundamental 
theorem of algebra). 


60.1 Equality 


Two complex numbers are equal if and only if their real parts are equal and their imaginary parts 
are equal. That is, a+ bi= c+ di if and only if a = c and b= d. 


1 http ://en. wikiversity.org/wiki/Complex%20Numbers 
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60.2 Notation and operations 

The set of all complex numbers is usually denoted by C, or in blackboard bold by C (Unicode ). The 
real numbers, R, may be regarded as "lying in" C by considering every real number as a complex: a 
= a + 0 i. 

Complex numbers arc added, subtracted, and multiplied by formally applying the associative, com- 
mutative and distributive laws of algebra, together with the equation r = &minus;l: 


( a + bi ) + (c + di) = [a + c) + [b + d)i 


(. a + bi )- (c + di) = {a-c) + ( b-d)i 


( a+ bi){c + di) = ac + bci + adi + bdi 2 = ( ac - bd) + {bc+ ad)i 

Division of complex numbers can also be defined (see below). Thus, the set of complex numbers 
forms a field which, in contrast to the real numbers, is algebraically closed. 

In mathematics, the adjective "complex" means that the field of complex numbers is the underlying 
number field considered, for example complex analysis, complex matrix, complex polynomial and 
complex Lie algebra. 


60.3 The field of complex numbers 

Formally, the complex numbers can be defined as ordered pairs of real numbers (a, b) together with 
the operations: 


[a, b) + (c, d) = (a + c,b + d) 


{a, b) ■ (c, d) = [ac - bd, be + ad). 

So defined, the complex numbers form a field, the complex number field, denoted by C (a field is 
an algebraic structure in which addition, subtraction, multiplication, and division are defined and 
satisfy certain algebraic laws. For example, the real numbers form a field). 

The real number a is identified with the complex number (a, 0), and in this way the field of real 
numbers R becomes a subfield of C. The imaginary unit i can then be defined as the complex number 
(0, 1), which verifies 


ia,b) = a - (1,0) + b- (0, 1) = a + bi and i 2 = (0, 1) • (0, 1) = (-1,0) = -1. 
In C, we have: 
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• additive identity ("zero"): (0, 0) 

• multiplicative identity ("one"): (1,0) 

• additive inverse of ( a,b ): (& min us; a, &minus;b) 

• multiplicative inverse (reciprocal) of non-zero (a, by. f 2 +yt > a 2 + b 2 ) • 

Since a complex number a + bi is uniquely specified by an ordered pair (a, b) of real numbers, 
the complex numbers are in one-to-one correspondence with points on a plane, called the complex 
plane. 


60.4 The complex plane 

A complex number z can be viewed as a point or a position vector in a two-dimensional Cartesian 
coordinate system called the complex plane or Argand diagram . The point and hence the complex 
number z can be specified by Cartesian (rectangular) coordinates. The Cartesian coordinates of the 
complex number are the real paid x= Re(z) and the imaginary paid y= Im(z). The representation of 
a complex number by its Cartesian coordinates is called the Cartesian form or rectangular form or 
algebraic form of that complex number. 


60.4.1 Polar form 

Alternatively, the complex number z can be specified by polar coordinates. The polar coordinates 
are r = Izl 0, called the absolute value or modulus, and = arg(z), called the argument of z. For 
r = 0 any value of describes the same number. To get a unique representation, a conventional choice 
is to set arg(0) = 0. For r > 0 the argument is unique modulo 2; that is, if any two values of the 
complex argument differ by an exact integer multiple of 2, they are considered equivalent. To get a 
unique representation, a conventional choice is to limit to the interval (-,], i.e. &minus; < . The 
representation of a complex number by its polar coordinates is called the polar form of the complex 
number. 


60.4.2 Conversion from the polar form to the Cartesian form 


x = rcoscp 


y - r sirup 


60.4.3 Conversion from the Cartesian form to the polar form 


r = 


x 2 + y 


2 
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arc lan (^) 
arctan(|) + n 
arctan (\)-n 

+ 2 

_ E 
2 

undefined 


if x> 0 

if x < 0 and y > 0 
if x < 0 and y < 0 
if x = 0 and y > 0 
if x = 0 and y < 0 
if x = 0 and y = 0. 


The previous formula requires rather laborious case differentiations. However, many programming 
languages provide a valiant of the arctangent function. A formula that uses the arccos function 
requires fewer case differentiations: 


if y > 0 and r ^ 0 
if y < 0 
if r = 0. 

60.4.4 Notation of the polar form 

The notation of the polar form as 


+ arccos 


cp = < 


- arccos 


undefined 


z = r (cos(p+ i sirup) 

is called trigonometric form. The notation cis is sometimes used as an abbreviation for cos + i sin 
. Using Euler’s formula 2 it can also be written as 

z = r e i<p , 

which is called exponential form. 


60.4.5 Multiplication, division, exponentiation, and root extraction in the polar 
form 

Multiplication, division, exponentiation, and root extraction are much easier in the polar form than 
in the Cartesian form. 

Using SUM AND DIFFERENCE IDENTITIES 3 its possible to obtain that 

r, e Up ' ■ r 2 e Upi = n r 2 e a(pl+(p2) 


and that 


2 http://en.wikipedia.org/wiki/Euler%27s%20formula 

3 http://en.wikipedia.org/wiki/List%20of%20trigonometric%20identiti e s%2 3Angle% 
20sum%20and%20difference%20identities 
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rie, ' yi - ri e i^~V2\ 
r 2 e i( P 2 r 2 

Exponentiation with integer exponents; according to DE Moivre’S FORMULA 4 , 

(r e itp ) n = r n e intp . 

Exponentiation with arbitrary complex exponents is discussed in the article on EXPONENTIATION 5 . 

The addition of two complex numbers is just the addition of two vectors, and multiplication by a 
fixed complex number can be seen as a simultaneous rotation and stretching. 

Multiplication by i corresponds to a counter-clockwise rotation by 90° (/2 radians). The geometric 
content of the equation i 2 = &minus;l is that a sequence of two 90 degree rotations results in a 
180 degree ( radians) rotation. Even the fact (&minus;l) • (&minus;l) = +1 from arithmetic can be 
understood geometrically as the combination of two 180 degree turns. 

All the roots of any number, real or complex, may be found with a simple algorithm. The nth roots 
arc given by 


, •( (p+2kn\ 

V re l< P = \fr eH n I 


for k = 0. 1 , 2 ,...,/; &minus; 1 , where \fr represents the principal nth root of r. 


60.5 Absolute value, conjugation and distance 

The absolute value (or modulus or magnitude) of a complex number z= re' is defined as Izl = r. 
Algebraically, if z = a + bu then | z\ = V a 2 + b 2 . 

One can check readily that the absolute value has three important properties: 


if and only if 


\z\=0 
z = 0 


(triangle inequality) 


\z+ w\ < \z\ + \w\ 


| Z- W I = \z\-\w\ 


4 http://en.wikipedia.0rg/w i ki /De%2 0Mo i vre%2 7 s%2 0 formula 

5 http://en.wikibooks.org/wiki/exponentiation 
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for all complex numbers z and w. It then follows, for example, that |1| = 1 and \zl w\ = \z\l\w\. By 
defining the distance function d(z, w) = I z &minus; w I we turn the set of complex numbers into a 
METRIC SPACE 6 and we can therefore talk about limits and continuity. 

The complex conjugate of the complex number z= a + bi is defined to be a &minus; bi, written 
as z or z * . As seen in the figure, z is the "reflection" of z about the real axis. The following can be 
checked: 


z+ w = z + w 


z-w = z-w 


C zl w) = zl w 


z = 


z 


if and only if z is real 


z = 


z 


|z| = |z| 
|z| 2 = z-z 


z 1 = z • I z| 2 

if z is non-zero. 

The latter formula is the method of choice to compute the inverse of a complex number if it is given 
in rectangular coordinates. 

That conjugation commutes with all the algebraic operations (and many functions; e.g. sinz = sinz) 
is rooted in the ambiguity in choice of i (&minus;l has two square roots). It is important to note, 
however, that the function /(z) - z is not complex-differentiable. 


60.6 Complex fractions 


We can divide a complex number (a + bi) by another complex number (c + di) 0 in two ways. The 
first way has already been implied: to convert both complex numbers into exponential form, from 
which their quotient is easily derived. The second way is to express the division as a fraction, then 
to multiply both numerator and denominator by the complex conjugate of the denominator. The 
new denominator is a real number. 


6 http://en.wikipedia.org/wiki/metric%20space 
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a + bi 
c + di 


{a+ bi){c- di) ( ac+bd ) + [be- ad)i 


(c + di)(c- di) 
(ac+bd\ .(be- 


■ + d 2 


+ d 2 


+ l 


ad 1 


c 2 + d 2 


60.7 Matrix representation of complex numbers 

While usually not useful, alternative representations of the complex field can give some insight into 
its nature. One particularly elegant representation interprets each complex number as a 2x2 matrix 
with real entries which stretches and rotates the points of the plane. Every such matrix has the form 

a -b 
b a 

where a and b are real numbers. The sum and product of two such matrices is again of this form. 
Every non-zero matrix of this form is invertible, and its inverse is again of this form. Therefore, the 
matrices of this form are a field. In fact, this is exactly the field of complex numbers. Every such 
matrix can be written as 


a 

b 


-b 

a 



+ b 


-1 

0 


which suggests that we should identify the real number 1 with the identity matrix 


1 0 

0 l] ’ 

and the imaginary unit i with 

0 -T 

1 0 ’ 

a counter-clockwise rotation by 90 degrees. Note that the square of this latter matrix is indeed equal 
to the 2x2 matrix that represents &minus;l. 

The square of the absolute value of a complex number expressed as a matrix is equal to the deter- 
minant of that matrix. 


= (a 2 ) - ((-b)(b)) - a 2 + b 2 

If the matrix is viewed as a transformation of the plane, then the transformation rotates points 
through an angle equal to the argument of the complex number and scales by a factor equal to 
the complex number’s absolute value. The conjugate of the complex number z corresponds to the 
transformation which rotates through the same angle as z but in the opposite direction, and scales in 
the same manner as z; this can be represented by the transpose of the matrix corresponding to z. 
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If the matrix elements are themselves complex numbers, the resulting algebra is that of the quater- 
nions. In other words, this matrix representation is one way of expressing the Cayley-Dickson 
construction of algebras. 

7 


7 http://en.wikibooks.org/wiki /Cate gor y%3ACalculus%2 0%2 8book%2 9 
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61.0.1 Calculus/Choosing delta 

This page is an addendum to Calculus /Formal Definition of the Limit j . 

Recall the definition of a limit: 

A number L is the limit of a function f(x) as x approaches c if and only if for all numbers e > 0 there 
exists a number 8 > 0 such that 

|/(x)-L| < e 
whenever 
0 < |x- c\ < 8. 

In other words, given a number e we must construct a number 8 such that assuming 
0 < |x- c\ < 8 


we can prove 


|/(x)-L| <c; 


moreover, this proof must work for all values of e > 0. 

Note: this definition is not constructive — it does not tell you how to find the limit L, only how to 
check whether a particular value is indeed the limit. We use the informal definition of the limit, 
experience with similar problems, or theorems (L’Hopital’s rule, for example), to determine the 
value, and then can prove the correctness of this value using the formal definition. 

Example 1: Suppose we want to find the limit of f(x) = x + 5 as x approaches c = 9. We know that 
the limit Lis 9+5=14, and desire to prove this. 

We choose 8 = e (this will be explained later). Then, since we assume 
|x-9| < 8 


1 Chapter 1 1 on page 79 
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we can show 

|(x + 5) — 14| = |x-9| 

< d , 

= £ 

which is what we wanted to prove. 

We chose 8 by working backwards from the formula we are dying to prove: 
|/(x)-L| <e. 

In this case, we desire to prove 
|x-9| <e. 


given 


|x-9| < <5, 


so the easiest way to prove it is by choosing 8 = e. This example, however, is too easy to adequately 
explain how to choose 8 in general. Lets try something harder: 

Example 2: Prove that the limit of f(x) = x 2 - 9 as x approaches 2 is L = -5. 

We want to prove that 

|/(x)-L| = | x 2 — 4 1 < e 


given 


|x-2| < 8. 

We choose 8 by working backwards. First, we need to rewrite the equation we want to prove using 
8 instead of x: 

| x 2 — 4 1 < e 

\x-2\-\x + 2\ < £ 

(<5) • (5 + 4) - £ 

Note: we used the fact that lx + 21 < 8 + 4, which can be proven with the triangle inequality. 

Word of caution: the above series of equations is not a logical series of steps, and is not part of any 
proof, but is an informal technique used to help write the proof. We will select a value of 8 so that 
the last equation is true, and then use the last equation to prove the equations above it in turn (which 
is what was meant earlier by working backwards). 
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Note: in the equations above, when 8 was substituted for x, the sign < was replaced with =. This 
can be done (but is not necessary) because we are not told that lx-21 = 8, but rather lx-21 < 8. The 
justification for this becomes clear when the above equations arc used in backwards order in the 
proof. 

We can solve this last equation for 8 using the quadratic formula: 

5= _ 4W16 ^ 4 . H _ el = _ 2+v — 

Note: 8 is always in terms of e. A constant value of 8 (e.g., 8 = 0.5) will never work. 

Now, we have a value of 8, and we can do our proof: 
given 

|x-2| < 8, 

| x 2 — 4 1 
|x-2| • |x + 2| 

(5) -(<5 + 4) 

(v / 4 + c-2)-(v / 4 + e + 2)' 

(^4+7) 2 - (2) 2 
£ 

Here a few more examples of choosing 5; try to figure them out before reading the explanation. 
Example 3: Prove that the limit of f(x) = sin(x)/x as x approaches 0 is L = 1. 

Explanation: 

Example 4: Prove that fix) = 1/x has no limit as x approaches 0. 

Example 5 : 

Prove that lim t _2 x 2 = 4 

Solution: To do it, we’ll look at two cases: e > 4 and e < 4. The e > 4 case is easy. First let’s 
let e = 4. That means we want the values chosen in the domain to map to (0, 8) in the range. We 
want a delta such that (2 + <5) 2 = 8 so let’s choose 8 = 2\/2-2. The chosen 8 defines the interval 
(4 - 2\[2,2\[2) ~ (1.1716,2.8284) in our domain. This gets mapped to (24- 16\/2,8) ~ (1.3726,8) 
in our range, which is contained in (0, 8). Notice that 8 doesn’t depend on e. So for e > 4, we widen 
the interval in the range that we are allowed to map onto, but our interval in the domain stays fixed 
and always maps to the same sub-interval in the range. So 8 = 2\f2-2 works for any e > 4. 

Now suppose 0 < e < 4. We want a 8 such that 0 < |x 2 - 4| < e whenever 0 < |x - 2| < 8. So let’s 
assume 0 < |x 2 - 4| < e and work backwards to find a suitable 8: 

0 < \x 2 - 4\ < e 


f{x)-L\ = 

< 

< 

< 

< 


-e < x 2 - 4 < e 
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Appendix 


4-e<x 2 <e + 4 

Since 0 < e < 4, we have 4-e > 0. Since both numbers above are positive, we can take the (positive) 
square root of both extremes of the inequality: 


\/(4 - e) < x < \f(e + 4) 


\/(4 — e)— 2<x — 2< \/[e + 4) — 2 

The above equation represents the distance, either negative or positive, that x can vary from 2 and 
still be within e of 4. We want to choose the smaller of the two extremes to construct our interval. 
It turns out that | \f{e + 4) - 2| < | \f{A - e) - 2| for 0 < e < 4, so choose 5 = \/le + 4) - 2. As a sanity 
check, let’s try with e = 0.002. 


8 = \J\e + 4) - 2 


8= x/(0.002 + 4) - 2 


which is approximately 


8 = 0.0004999375 

At the extreme right of the domain, this gives 


x = 2.0004999375 


and 


X 2 = 2.0004999375 2 = 4.00199999993750390625 

which is within 0.002 of 4. 

2 


2 http://en.wikibooks.org/wiki /Cate gor y%3ACalculus%2 0%2 8book%2 9 
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62 Exercise Solutions 


62.0. 2 Precalculus Solutions 

62.0. 3 Convert to interval notation 

1. — 4 < x < 2 (-4,2) 

2. -7/3 <x< -1/3 [-7/3, -1/3] 

3. -n<x<n [~7i,7i) 

4. x < 17/9 (-, 17/9] 

5. 5 < X + 1 < 6 [4, 5] 

6. x— 1/4 < 1 (-, 5/4) 

7. 3 > 3x (-, 1) 

8. 0 < 2x + 1 < 3 [-1/2, 1) 

9. 5 < x and x < 6(-,5) 

10. 5 < x or x < 6(-,6) 


62.0.4 Simplify the following 

1. x 3 + 3x 3 


9 x 3 + 3 x 3 

' r 2 


3. (x 3 + 3x 3 ) 3 


4 x 15 +x 3 

* X 


5. (2x 2 )(3x“ 2 ) 


4x 3 


4x 


64x 9 


x 14 + x 2 


6 
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Exercise Solutions 


6 . 


x y 


7. 


1 

xy 5 


xy 


2 


8 



1/3 


2x 2 

4 ~ 

y* 


62.1 Functions 

1. a) / (0) = 0, /(2) = 4 

b) The domain is (- 00 , 00 ); the range is [0,oo), 

c) No, since / isn’t one-to-one; for example, /(- 1) = /( 1) = 1. 

2. a) i. (/ + g)(x) = X + 2+ 1/x = (x 2 + 2x + l)/x. 

ii. (f-g)(x) = x + 2- 1/x = (x 2 + 2x- l)/x. 

iii. (g-/)(x) = llx-x-2- (1-x 2 -2x)/x. 

iv. (/xg)(x) = (x + 2)/x. 

v. (/7g)(A) = x(x + 2) provided x^O. Note that 0 is not in the domain of //g, since 
it’s not in the domain of g, and you can’t divide by something that doesn’t exist! 

vi. (g//)(x) = l/[x(x+2)]. Although 0 is still not in the domain, we don’t need to state 
it now, since 0 isn’t in the domain of the expression l/[x(x + 2)] either. 

vii. (/og)(x) = l/x + 2 = (2x+ 1) / x. 

viii. (go/)(x) = l/(x + 2). 

b) /(g(2)) = 5/2; g(/(2)) = 1/4. 

c) Yes; / -1 (x) = x-2 and g _I (x) = 1/x. Note that g and its inverse are the same. 

3. As pictured, by the Vertical Line test, this graph represents a function. 


62.1.1 Limits Solutions 

62.2 Basic Limit Exercises 

(1) lim x ^2 (4x 2 - 3x + 1) 


Since this is a polynomial, two can simply be plugged in. This 
results in 4 (4) -2 (3) +1=16-6+1=11 
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One Sided Limits Exercises 


(2) lim x ^ 5 (x 2 ) 


25 


62.3 One Sided Limits Exercises 

1. lim x ^o- x 3+2x 2 

Solution: Factor as In this form we can see that there is a removable discontinuity at x=0 

and that the limit is | 


62.4 Two Sided Limits Exercises 


( 3 ) lim.r— 4 -^4 

lim x-4+ 

+ 

(4 ) D . N . E . 

(5) -6 

( 6 ) 6 

(7) 3 

(8) 13/8 

(9) 10 

(10) D.N.E. 

(11) +infinity 


62.5 L’HopitaTs rule 

1 . 2 
2 . -1 
3 - 

4 

4. 0 

5. -2 


62.5.1 Differentiation Solutions 


62.6 Find The Derivative By Definition 


1. 2x 


fix) 


= lim Ax ~o tx+A A f ~ x2 

_ i;™ x 2 +2xAx+Ax 2 -x 2 

- tint Ax— >0 Ax 

_ l;_, 2xAx+Ax 2 

- limax-0 Ai 

= limAx^o2x + Ax 
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Exercise Solutions 


= 2x 


2 . 2 


fix) 

fix) 


— 2x + 2 

_ i • [. 2(x+Ax)+2]-(2x+2 ) 

— nni/\ x -^o 

i; w 2x+2Ax+2 — 2.x — 2 

“ lim A*-*0 Ax 

= lim A x^o ^ 


3. x 

fix) 

fix) 


_ I„2 

- 2 A 


= limax-o - 


|(x+&x) 2 -|x 2 


lAx^O ' 


= lim 


Ax^O 


lim Ax - 


Ax 

\ (x 2 +2xAx+ Ax 2 ) — | x 2 
AX 

, 2xAx , \j ^ _ x 2 
2 + 2 + 2 2 

Ax 

2xAx+Ax 2 


2Ax 

= lintAx^o x + Ax 


= x 


4. 4x + 4 

/(x) =2 x 2 + 4x + 4 

y/ ^ _ Jj mA Q [2 (x+Ax) 2 +4(x+Ax)+4]-(2x 2 +4x+4) 

_ is— 2 (x 2 +2xAx+Ax 2 )+4x+4Ax+4-2x 2 -4x-4 

- limAx^O Ax 

_ u_, 2 x 2 +4xAx+2Ax 2 +4Ax-2x 2 

- lintAx^o — 

- liniAx^o Ax 

= lintAx^o 4x + 2 Ax + 4 
= 4x + 4 


62.7 Prove Differentiation Rules 


62.7.1 Proof of the Derivative of a Constant Function 

If c = fix), then 

f x [c) =f'ix) 

= lim A x V (X+A AT /M 
= lim A x^o c ~zf 
= lintAx-o 0 
= 0 


62.7.2 Proof of the Derivative of a Linear Function 
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Find The Derivative By Rules 


If mx + b = fix), then 
-j x (mx + b) 


= fix) 

= lim A ^o /U+A A t /W 

_ u™ [m(x+Ax)+b]-[mx+b] 

- lim Ax _ 0 Ax 

_ !•_. mx+mXx+b-mx-b 

- HniAx-{) T7 

- lim mAx 

~ llm Ax-0 "aJ- 

= lim A x-o tn 
= m 


62.7.3 Proof of the Constant Multiple Rule 

i\ cm] =iim 

= clim ai .o feMVM 

= [/W] 


62.7.4 Proof of the Addition and Subtraction Rules 


£[fix)±g{x)] 


•0 ■ 


•0 ■ 


[/(x+Ax)+g(x+Ax)]-[/(x)+g(x)] 
Ax 

/(x+Ax)+g(x+Ax)-/(x)-g(x) 

Ax 

/(x+Ax)-/(x) g(x+Ax)-g(x) 
Ax + 

J(x+/S.x)-f(x) 

Ax 


= lim Ax _ 

= lim Ax _ 

= lim Ax _ 

= lim Ax _ 

= 37 [/I-<1] + A 


Ax 


+ 


62.8 Find The Derivative By Rules 


1. 4x 
2 - 2 — 

2 -2 .1. 

4 I — — 

5. cos(x) -sin(x) 

6. 2(x + 5) 


7. 

8 . 


X 

v222? 

2V2x 2 + 1(3x 4 + 2x)(12x 3 + 2) + 


2x(3x 4 +2x) 

?2x 2 +1 


9. ln(4)4 x 
10 . 


9x 


2 \2? 

11 -^ + 2 

1 1 ' xln4 ^ x 


+ 3ln(2)Vx 3 — 2(2 X_3 ) + \ 


12. 3e x + 4sin(x) - ^ 

13. 10x 4 + 16x+ 1 

14. 49x 6 + 40x 4 + 3x 2 + 2x — 1 


2 
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Exercise Solutions 


62.9 Logarithmic Differentiation 


1. 

2 . 

3. 



3 A' 2 

4(l-x 3 ) , 


dy _ y r i , i i 

dx 2 1 1+x 1-jcl 


g = 2{ln{x) + ln{2) + 1 ) 


4. g = (x 3 + 4x) 3x (9x 3 + 3 x 3 (Z»(x 3 + 4x)) + 12x(/n(x 3 + 4x)) + 3x 2 + 12x + 4) 


62.10 Implicit Differentiation 

Recall that g is the same as y' . 

1. 

g(x 2 + y 2 = l) 

2x + 2yf x = 0 
soive for g 

dy _ _x 
dx y 

2 . 

g(x 3 + y 3 = xy) 

3x 2 + 3y 2 g = y + xg 

solve for g 

3x 2 - y = g (x - 3y 2 ) 

dy _ 3x 2 -y 
dx ~ x-3y 2 

62.1 1 Higher Order Derivatives 

1. 36x 2 + 6 
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Solutions to Set One 


62. 1 1 . 1 Integration Solutions 


62.12 Solutions to Set One 


62.13 Solutions to Set Two 


1. ^ — ^+4 x+C 

2. 2x 4 + C 

3. |x 3 + ^x 4 + C 

4. |ix 33 + x 4 -|x 5 + C 

5. -5x _1 + C 


62.14 Solutions to Set Three 


1. / (cosx + sinx)dx 


cosxdx+ / sinxrfx 


2. / 3sinxdx 


3. / l + tan 2 xdx 


4. -tanx+ C 

5. -e x + C 

6. 8e x + C 


= sinx-cosx +C 


= 3 x J si 


sinxdx 


= -3cosx + C 


/ 


= I sec z xdx 


= tan x + C 
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Exercise Solutions 


7. ^ln|x| + C 

8. with the substitution x = ntan0, we have dx = asec 2 6 d9, and x 2 + a 2 = a 2 sec 2 9, so that 


/ 

-I 


1 

x 2 + a 2 


dx 


a sec 2 6 d6 
a 2 sec 2 9 


/ 


-d9 

a 


= -9 + C 
a 


1 x 

= — arctan — + C 
a a 


l 


1 http://en.wikibooks.org/wiki /Cate gor y%3ACalculus%2 0%2 8book%2 9 
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64 Table of Trigonometry 


Trigonometricjdentity 1 


64.1 Definitions 


* tan(x) = ^| 

* sec M = drx 

* cot(x) = fgf 

* CSC M = ii 5i 


1 

tanx 


64.2 Pythagorean Identities 

• sin 2 x + cos 2 x = 1 

• 1 + tan 2 (x) = sec 2 x 

• 1 + cot 2 (x) = csc 2 x 


64.3 Double Angle Identities 


• sin(2x) = 2 sin x cos x 


• cos(2x) = cos 2 x - sin 2 x 


• tan(2x) = 

• cos 2 (x) = 

• sin 2 (x) = 


2tan(x) 
1-tan 2 (x) 
l+cos(2x) 
2 

l-cos(2x) 

2 


64.4 Angle Sum Identities 


sin(x + y) = sinxcosy + cosxsiny 


sin(x-y) = sinxcosy- cosxsiny 


1 http ://en. wikipedia.org/wiki/Trigonometric_ identity 
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Table of Trigonometry 


cos(x + y) = cos x cos y- sin x sin y 


cos(x-y) = cosxcosy + sinxsiny 


(X + y\ (X-y 
sinx + siny = 2sin^ — - — Jcos^ — — — 


(X + y\ x-y 

sinx- siny = 2cos^ — - — J sin — - — 


rx + y\ (X-y 

cos x + cosy = 2 cos cos 

J 2 > 2 


. (X + y\ . (X-y 

cos x- cosy = -2 sin sin 

2 > 2 


sin (x + y) 

tanx + tany= 

cos x cosy 


sin (x - y) 

tanx-tany = 

cos x cosy 


sin (x + y) 

cotx + coty = 

sin x sin y 


- sin fx - y) 

cotx - coty = 

sinx siny 


64.5 Product-to-sum identities 


cos (x) cos (y) = 
sin (x) sin (y) = 
sin (x) cos (y) = 
cos (x) sin (y) = 


cos (x + y) + cos (x - y) 


cos| 

[x-y] 

2 

-cos 

(*+y) 

sin 

2 

(x + y) + sin | 

{x-y) 

sin 

2 

(x + y) - sin | 

[x-y] 

2 
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65 Summation notation 


Summation notation allows an expression that contains a sum to be expressed in a simple, compact 
manner. The uppercase Greek letter sigma, Z, is used to denote the sum of a set of numbers. 

Example 


£ i 2 = 3 2 + 4 2 + 5 2 + 6 2 + 7 2 

i = 3 

<blockquote style="background: white; border: lpx solid black; padding: lem;"> Let /be a func- 
tion and N,M arc integers with N < M. Then 

M 

X /( i) = f(N] + f{N+ 1) + /(1V+2) -I 1- /(M). 

i=N 

We say IV is the lower limit and Mis the upper limit of the sum. </blockquote> 

We can replace the letter i with any other variable. For this reason i is referred to as a dummy 
variable. So 


4 4 4 

X i = X ^ = X a “ 1 + 2 + 3 + 4 

i = 1 ;'=1 a=l 

Conventionally we use the letters /, / /c, m for dummy variables. 
Example 


5 

i — 1 + 2 + 3 + 4 + 5 

(=1 

Here, the dummy variable is i, the lower limit of summation is 1, and the upper limit is 5. 
Example 

Sometimes, you will see summation signs with no dummy variable specified, e.g., 

£ z' 3 = 100 

l 
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Summation notation 


In such cases the correct dummy variable should be clear from the context. 

You may also see cases where the limits arc unspecified. Here too, they must be deduced from the 
context. 

65.0.1 Common summations 

Summation#Capital-sigma notation 1 
£" =1 c = c + c + ... + c= nc,ce R 
I? =1 i = 1+2 + 3 + ... + n= 

I" =1 i 2 = l 2 + 2 2 +3 2 + ... + n 2 = ,i(,i+1) 6 (2 ” +1) 

I? =1 i 3 = l 3 + 2 3 + 3 3 + ... + n 3 = 

2 


1 http://en.wikipedia.org/wiki/Summation%23Capital-sigma%20notation 

2 http://en.wikibooks.org/wiki /Cate gory%3ACalculus%2 0%2 8book%2 9 
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66 Tables of Integrals 


66.1 Rules 

• / cfix) dx= cf f{x) dx 

• f f{x) + g(x)dx = f f(x)dx + f g(x)dx 

• f fix)- gix)dx = f f{x)dx- f gix)dx 

• f udv = uv- f vdu 


66.2 Powers 


• f dx = x+ C 

• f adx= ax + C 

• fx n dx= ~^r[X n+l + C if n #-l 

• /i dx = In | jc| + C 

' h[kb dx = l a Xn \ax + b\ + c 

66.3 Trigonometric Functions 

66.3.1 Basic Trigonometric Functions 

• fsinxdx = -cosx + C 

• fcosxdx= sinx + C 

• ftanxdx = ln|secx| + C 

• fsin 2 xdx= \x- |sin2x + C 

• fcos 2 xdx= \x + |sin2x + C 

• ftan 2 xdx = tan(x) -x + C 

• / sin” xdx=- sin ' 1 ~ 1 > f cosx + ^ f sin”" 2 xdx+C (for n > 0) 

• / cos' 1 x <ix = — n,s " t ; rsin - r + n^f cos”" 2 x dx + C (for n > 0) 

• ftan n xdx = ^yytan ,,_1 x- /tan' !-2 xdx + C (for n ^ 1) 


66.3.2 Reciprocal Trigonometric Functions 

• f secxdx = ln|secx + tanx| + C = ln|tan(|x+ \n) \ + C 

• f esc xdx = -ln|cscx + cotx| + C = ln|tan(|x)| + C 

• f cot xdx = ln|sinx| + C 
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Tables of Integrals 


• /sec 2 kxdx = |tan kx + C 

• f esc 2 kxdx = -jeotkx + C 

• f cot 2 kxdx = -x- jeotkx + C 

• fsecxtanxdx= secx + C 

• fsecxcscxdx = ln|tanx| + C 

• / sec n xdx- sec "^_/ smx + j S ec n ~ 2 xdx + C (for n ^ 1) 

• / esc” xdx = - + n=±f esc”" 2 xdx + C (for n ? 1) 

• fcot n xdx= -^/yCot" _1 x-/cot' 7_2 xrfx+C (for n # 1) 


66.3.3 Inverse Trigonometric Functions 


* / 
* / 


t== dx = arcsin(x) + C 

!\-X l 
l 


\faF~- 


■ dx = arcsin(x/ a) + C 


• f dx = arctan(x) + C 

* f .1. v 2 dx = ^arctan [x/a) + C 

** a i jc ( f 


if a # 0 
if a ^ 0 


66.4 Exponential and Logarithmic Functions 

• f e x dx = e x + C 

• fe ax dx= k e ax + c if a^O 

• fa x dx=^a x + C ifa>0, a^l 

• f\nxdx = xlnx- x+ C 


66.5 Inverse Trigonometric Functions 

• / arcsin(x) dx = xarcsin(x) + VI - x 2 + C 

• / arccos(x) dx = xarccos(x) - Vl- x 2 + C 

• f arctan(x) dx = xarctan(x) - ^ ln(l + x 2 ) + C 


66.6 Further Resources 

Lists of integrals 1 


1 http://en.wikipedia.org/wiki/Lists%20of%20integrals 
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67 Tables of Derivatives 


Table of derivatives 


67.1 General Rules 


s ( / + s r) - S + l X 

£(cf)=c d / x 

"</*) = /a f + si 


dx 

A 

dx 


( 9 - 


„rf/_ f rfg 
o dx J dx 


67.2 Powers and Polynomials 


ii x = 1 

d _ „ v «-l 
dx x “ nx 

7ev / x = 27t 

_rf_ j_ _ _ j_ 

dx x x ' 2 

j^(c n X n + C„_iX” _1 + C n - 2 X n ~ 2 + 


+ c 2 x z + c\x + Co) = nc n x n 1 + {n - \)c n -\x 


2 ) C n - 2 X n 3 H + 2C2X+C1 


67.3 Trigonometric Functions 

■ffc sin(x) = cos(x) 

J^cos(x) = -sin(x) 

^tan(x) = sec 2 (x) 

^cot(x) = -csc 2 (x) 
sec(x) = sec(x) tan(x) 
csc(x) = - csc(x) cot(x) 


1 http://en.wikipedia.org/wik i /Table_of_deri VAT I VE s 


2 + [n - 
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Tables of Derivatives 


67.4 Exponential and Logarithmic Functions 


d_ p x _ x 
dx e ~ e 

~^.a x = a x ln(a) 

E>"W = ; 


1 


xln (a) 


if a > 0 


if a > 0, a ^ 1 


( / & )'=K n/ y=/ g (/'f +g' ln /) f 

(</ )' = (e^ lnc ’) , = f'cf Inc 


/> 0 


67.5 Inverse Trigonometric Functions 


s ai ' csinx= 7f=i 
^arccos x = -y= 

^arctanx= I ^ ? 

_rf_ r _ i 

dx. 


|x|Vx 


dx* 1+x 2 
-1 


dx* 


|x|Vx 


67.6 Hyperbolic and Inverse Hyperbolic Functions 


— sinhx = coshx 
dx 


— coshx = sinhx 
dx 


— tanhx = sech 2 x 
dx 


— sech x = - tanh x sech x 
dx 


— cothx= -csch 2 x 
dx 


— cschx= -cothxcschx 
dx 


-j- sinh 1 x = 1 

dx vx 2 + 1 

-j- cosh* 1 x = 1 

dx Vx 2 -1 
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Hyperbolic and Inverse Hyperbolic Functions 


u .-1 
— tanh x = 
ax 


1 

1-x 2 


-^-sech l x = — ^ 
dx xv 1-x 2 


U i 

— coth x = 
ax 


-1 

1-x 2 


-^-csch 1 x = 1 

dx \xW\ + x 2 
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